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CONSTRUCT VALIDITY IN PSYCHOLOGICAL TESTS 


LEE J. CRONBACH 


PAUL E. MEEHL! 


AND 


University of Illinois 


Validation of psychological tests 
has not yet been adequately concep- 
tualized, as the APA Committee on 
Psychological Tests learned when it 
undertook (1950-54) to specify what 
qualities should be investigated be- 
fore a test is published. In order to 
make coherent recommendations the 
Committee found it necessary to dis- 
tinguish four types of validity, estab- 
lished by different types of research 
and requiring different interpreta- 
tion. The chief innovation in the 
Committee’s report was the term con- 
struct validity.2 This idea was first 
formulated by a_ subcommittee 
(Meehl and R. C. Challman) study- 
ing how proposed recommendations 
would apply to projective techniques, 
and later modified and clarified by 
the entire Committee (Bordin, Chall- 
man, Conrad, Humphreys, Super, 
and the present writers). The state- 
ments agreed upon by the Commit- 
tee (and by committees of two other 
associations) were published in the 
Technical Recommendations (59). The 
present interpretation of construct 
validity is not “official’’ and deals 


! The second author worked on this prob- 
lem in connection with his appointment to the 
Minnesota Center for Philosophy of Science. 
We are indebted to the other members of the 
Center (Herbert Feigl, Michael Scriven, 
Wilfrid Sellars), and to D. L. Thistlethwaite 
of the University of Illinois, for their major 
contributions to our thinking and their sug- 
gestions for improving this paper. 

? Referred to in a preliminary report (58) 
as congruent validity. 


University of Minnesota 


with some areas where the Committee 
would probably not be unanimous. 
The present writers are solely respon- 
sible for this attempt to explain the 
concept and elaborate its implica- 
tions. 

Identification of construct validity 
was not an isolated development. 
Writers on validity during the pre- 
ceding decade had shown a great deal 
of dissatisfaction with conventional 
notions of validity, and introduced 
new terms and ideas, but the result- 
ing aggregation of types of validity 
seems only to have stirred the muddy 
waters. Portions of the distinctions 
we shall discuss are implicit in Jen- 
kins’ paper, ‘Validity for what?” 
(33), Gulliksen’s ‘‘Intrinsic validity” 
(27), Goodenough’s distinction be- 
tween tests as “signs” and “samples” 
(22), Cronbach's separation of “‘logi- 
cal” and “empirical” validity (11), 
Guilford’s “factorial validity’’ (25), 
and Mosier’s papers on “‘face valid- 
ity” and “validity generalization” 
(49, 50). Helen Peak (52) comes 
close to an explicit statement of con- 
struct validity as we shall present it. 


Four TYPES OF VALIDATION 


The categories into which the Rec- 
ommendations divide validity studies 
are: predictive validity, concurrent 
validity, content validity, and con- 
struct validity. The first two of these 
may be considered together as cri- 
terion-oriented validation procedures. 

The pattern of a criterion-oriented 
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study is familiar. The investigator is 
primarily interested in some criterion 
which he wishes to predict. He ad- 
ministers the test, obtains an inde- 
pendent criterion measure on the 
same subjects, and computes a cor- 
relation. If the criterion is obtained 
some time after the test is given, he is 
studying predictive validity. If the 
test score and criterion score are de- 
termined at essentially the same time, 
he is studying concurrent validity. 
Concurrent validity is studied when 
one test is proposed as a substitute 
for another (for example, when a 
multiple-choice form of spelling test 
is substituted for taking dictation), 
or a test is shown to correlate with 
some contemporary criterion (e.g., 
psychiatric diagnosis). 

Content validity is established by 
showing that the test items are a sam- 
ple of a universe in which the investi- 
gator is interested. Content validity 
is ordinarily to be established de- 
ductively, by defining a universe of 
items and sampling systematically 
within this universe to establish the 
test. 

Construct validation is involved 
whenever a test is to be interpreted 
as a measure of some attribute or 
quality which is not “operationally 
defined.”” The problem faced by the 
investigator is, ‘What constructs 
account for variance in test perform- 
ance?”’ Construct validity calls for 
no new scientific approach. Much 
current research on tests of personal- 
ity (9) is construct validation, usu- 
ally without the benefit of a clear 
formulation of this process. 

Construct validity is not tobe iden- 
tified solely by particular investiga- 
tive procedures, but by the orienta- 
tion of the investigator. Criterion- 
oriented validity, as Bechtoldt em- 
phasizes (3, p. 1245), “involves the 
acceptance of a set of operations as an 
adequate definition of whatever is to 
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be measured.’’ When an investigator 
believes that no criterion available to 
him is fully valid, he perforce be- 
comes interested in construct validity 
because this is the only way to avoid 
the “infinite frustration” of relating 
every criterion to some more ultimate 
standard (21). In content validation, 
acceptance of the universe of content 
as defining the variable to be meas- 
ured is essential. Construct validity 
must be investigated whenever no 
criterion or universe of content is 
accepted as entirely adequate to de- 
fine the quality to be measured. De- 
termining what psychological con- 
structs account for test performance 
is desirable for almost any test. Thus, 
although the MMPI was originally 
established on the basis of empirical 
discrimination between patient 
groups and so-called normals (con- 
current validity), continuing research 
has tried to provide a basis for de- 
scribing the personality associated 
with each score pattern. Such inter- 
pretations permit the clinician to pre- 
dict performance with respect to cri- 
teria which have not yet been em- 
ployed in empirical validation studies 
(cf. 46, pp. 49-50, 110-111). 


We can distinguish among the four types 
of validity by noting that each involves a 
different emphasis on the criterion. In pre- 
dictive or concurrent validity, the criterion 
behavior is of concern to the tester, and he 
may have mo concern whatsoever with the 
type of behavior exhibited in the test. (An 
employer does not care if a worker can mani- 
pulate blocks, but the score on the block 
test may predict something he cares about.) 
Content validity is studied when the tester 
is concerned with the type of behavior in- 
volved in the test performance. Indeed, if the 
test is a work sample, the behavior repre- 
sented in the test may be an end in itself. 
Construct validity is ordinarily studied when 
the tester has no definite criterion measure 
of the quality with which he is concerned, and 
must use indirect measures. Here the trait or 
quality underlying the test is of central im- 
portance, rather than either the test behavior 
or the scores on the criteria (59, p. 14). 
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Construct validation is important 

at times for every sort of psychologi- 
cal test: aptitude, achievement, in- 
terests, and so on. Thurstone’s state- 
ment is interesting in this connec- 
tion: 
In the field of intelligence tests, it used to be 
common to define validity as the correlation 
between a test score and some outside cri- 
terion. We have reached a stage of sophistica- 
tion where the test-criterion correlation is 
too coarse. It is obsolete. If we attempted to 
ascertain the validity of a test for the second 
space-factor, for example, we would have to 
get judges [to] make reliable judgments about 
people as to this factor. Ordinarily their 
[the available judges’] ratings would be of 
no value as a criterion. Consequently, validity 
studies in the cognitive functions now depend 
on criteria of internal consistency... (60, 
p. 3). 


Construct validity would be involved 
in answering such questions as: To 
what extent is this test of intelligence 
culture-free? Does this test of ‘‘inter- 
pretation of data’’ measure reading 
ability, quantitative reasoning, or re- 
sponse sets? How does a person with 


A in Strong Accountant, and B in 
Strong CPA, differ from a person who 
has these scores reversed? 

Example of construct validation pro- 
cedure. Suppose measure X correlates 
.50 with Y, the amount of palmar 
sweating induced when we tell a stu- 
dent that he has failed a Psychology 
I exam. Predictive validity of X for 
Y is adequately described by the co- 
efficient, and a statement of the ex- 
perimental and sampling conditions. 
If someone were to ask, “Isn’t there 
perhaps another way to interpret this 
correlation?’’ or ‘‘What other kinds 
of evidence can you bring to support 
your interpretation?’’, we would 
hardly understand what he was ask- 
ing because no interpretation has 
been made. These questions become 
relevant when the correlation is ad- 
vanced as evidence that “test X 
measures anxiety proneness.”’ Alter- 
native interpretations are possible; 
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e.g., perhaps the test measures “‘aca- 
demic aspiration,’’ in which case we 
will expect different results if we in- 
duce palmar sweating by economic 
threat. It is then reasonable to in- 
quire about other kinds of evidence. 

Add these facts from further stud- 
ies: Test X correlates .45 with fra- 
ternity brothers’ ratings on ‘‘tense- 
ness.” Test X correlates .55 with 
amount of intellectual inefficiency in- 
duced by painful electric shock, and 
.68 with the Taylor Anxiety scale. 
Mean X score decreases among four 
diagnosed groups in this order: anxi- 
ety state, reactive depression, ‘‘nor- 
mal,’’ and psychopathic personality. 
And palmar sweat under threat of 
failure in Psychology I correlates .60 
with threat of failure in mathematics. 
Negative results eliminate competing 
explanations of the X score; thus, 
findings of negligible correlations be- 
tween X and social class, vocational 
aim, and value-orientation make it 
fairly safe to reject the suggestion 
that X measures “academic aspira- 
tion."’ We can have substantial con- 
fidence that X does measure anxiety 
proneness if the current theory of 
anxiety can embrace the variates 
which yield positive correlations, and 
does not predict correlations where 
we found none. 


KINDS OF CONSTRUCTS 


At this point we should indicate 
summarily what we mean by a con- 
struct, recognizing that much of the 
remainder of the paper deals with 
this question. A construct is some 
postulated attribute of people, as- 
sumed to be reflected in test perform- 
ance. In test validation the attribute 
about which we make statements in 
interpreting a test is a construct. We 
expect a person at any time to possess 
or not possess a qualitative attribute 
(amnesia) or structure, or to possess 
some degree of a quantitative attrib- 
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ute (cheerfulness). A construct has 
certain associated meanings carried 
in statements of this general charac- 
ter: Persons who possess this attri- 
bute will, in situation X, act in manner 
Y (with a stated probability). The 
logic of construct validation is in- 
voked whether the construct is highly 
systematized or loose, used in rami- 
fied theory or a few simple proposi- 
tions, used in absolute propositions 
or probability statements. We seek 
to specify how one is to defend a pro- 
posed interpretation of a test; we 
are not recommending any one type of 
inter pretation. 

The constructs in which tests are 
to be interpreted are certainly not 
likely to be physiological. Most often 


they will be traits such as “‘latent hos- © 


tility’ or ‘‘variable in mood,”’ or de- 
scriptions in terms of an educational 
objective, as “ability to plan experi- 
ments.” For the benefit of readers 
who may have been influenced by cer- 
tain eisegeses of MacCorquodale 
and Meeh! (40), let us here empha- 
size: Whether or not an interpreta- 
tion of a test's properties or relations 
involves questions of construct valid- 
ity is to be decided by examining the 
entire body of evidence offered, to- 
gether with what is asserted about 
the test in the context of this evi- 
dence. Proposed identifications of 
constructs allegedly measured by the 
test with constructs of other sciences 
(e.g., genetics, neuroanatomy, bio- 
chemistry) make up only one class 
of construct-validity claims, and a 
rather minor one at present. Space 
does not permit full analysis of the 
relation of the present paper to the 
MacCorquodale-Meehl distinction 
between hypothetical constructs and 
intervening variables. The philoso- 
phy of science pertinent to the pres- 
ent paper is set forth later in the sec- 
tion entitled, ‘‘The nomological net- 
work.” 
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THE RELATION OF CONSTRUCTS 
TO “CRITERIA” 


Critical View of the Criterion Implied 


An unquestionable criterion may 
be found in a practical operation, or 
may be established as a consequence 
of an operational definition. Typi- 
cally, however, the psychologist is un- 
willing to use the directly operational 
approach because he is interested in 
building theory about a generalized 
construct. A theorist trying to relate 
behavior to “hunger’’ almost cer- 
tainly invests that term with mean- 
ings other than the _ operation 
“elapsed-time-since-feeding.’’ If he 
is concerned with hunger as a tissue 
need, he will not accept time lapse as 
equivalent to his construct because it 
fails to consider, among other things, 
energy expenditure of the animal. 

In some situations the criterion is 
no more valid than the test. Sup- 
pose, for example, that we want to 
know if counting the dots on Bender- 
Gestalt figure five indicates ‘‘com- 
pulsive rigidity,’”’ and take psychia- 
tric ratings on this trait as a criterion. 
Even a conventional report on the re- 
sulting correlation will say something 
about the extent and intensity of the 
psychiatrist's contacts and should 
describe his qualifications (e.g., dip- 
lomate status? analyzed?). 

Why report these facts? Because 
data are needed to indicate whether 
the criterion is any good. ‘“‘Compul- 
sive rigidity” is not really intended to 
mean “social stimulus value to psy- 
chiatrists."”. The implied trait in- 
volves a range of behavior-disposi- 
tions which may be very imperfectly 
sampled by the psychiatrist. Sup- 
pose dot-counting does not occur ina 
particular patient and yet we find 
that the psychiatrist has rated him as 
“rigid.’’ When questioned the psy- 


chiatrist tells us that the patient was 
a rather easy, free-wheeling sort; 
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however, the patient did lean over to 
straighten out a skewed desk blotter, 
and this, viewed against certain 
other facts, tipped the scale in favor 
of a “‘rigid’’ rating. On the face of it, 
counting Bender dots may be just as 
good (or poor) a sample of the com- 
pulsive-rigidity domain as straighten- 
ing desk blotters is. 

Suppose, to extend our example, we 
have four tests on the ‘predictor’ 
side, over against the psychiatrist’s 
“criterion,’’ and find generally posi- 
tive correlations among the five vari- 
ables. Surely it is artificial and arbi- 
trary to impose the “‘test-should-per- 
dict-criterion’”’ pattern on such data. 
The psychiatrist samples verbal con- 
tent, expressive pattern, voice, pos- 
ture, etc. The psychologist samples 
verbal content, perception, expres- 
sive pattern, etc. Our proper con- 


clusion is that, from this evidence, 
the four tests and the psychiatrist all 
assess some common factor. 

The asymmetry between the ‘‘test”’ 


and the so-designated ‘criterion’ 
arises only because the terminology 
of predictive validity has become a 
commonplace in test analysis. In 
this study where a construct is the 
central concern, any distinction be- 
tween the merit of the test and cri- 
terion variables would be justified 
only if it had already been shown 
that the psychiatrist’s theory and 
operations were excellent measures 
of the attribute. 


INADEQUACY OF VALIDATION IN 
TERMS OF SPECIFIC CRITERIA 


The proposal to validate construc- 
tual interpretations of tests runs 
counter to suggestions of some others. 
Spiker and McCandless (57) favor 
an operational approach. Validation 
is replaced by compiling statements 
as to how strongly the test predicts 
other observed variables of interest. 
To avoid requiring that each new 
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variable be investigated completely 
by itself, they allow two variables to 
collapse into one whenever the prop- 
erties of the operationally defined 
measures are the same: “If a new 
test is demonstrate’ to predict the 
scores on an older, well-established 
test, then an evaluation of the predic- 
tive power of the older test may be 
used for the new one.’’ But accurate 
inferences are possible only if the two 
tests correlate so highly that there 
is negligible reliable variance in either 
test, independent of the other. Where 
the correspondence is less close, one 
must either retain all the separate 
variables operationally defined or em- 
bark on construct validation. 

The practical user of tests must 

rely on constructs of some generality 
to make predictions about new situa- 
tions. Test X could be used to pre- 
dict palmar sweating in the face of 
failure without invoking any con- 
struct, but a counselor is more likely 
to be asked to forecast behavior in 
diverse or even unique situations for 
which the correlation of test X is un- 
known. Significant predictions rely 
on knowledge accumulated around 
the generalized construct of anxiety. 
The Technical Recommendations 
state: 
It is ordinarily necessary to evaluate construct 
validity by integrating evidence from many 
different sources. The problem of construct 
validation becomes especially acute in the 
clinical field since for many of the constructs 
dealt with it is not a question of finding an 
imperfect criterion but of finding any criterion 
at all. The psychologist interested in con- 
struct validity for clinical devices is concerned 
with making an estimate of a hypothetical 
internal process, factor, system, structure, 
or state and cannot expect to find a clear 
unitary behavioral criterion. An attempt to 
identify any one criterion measure or any 
composite as the criterion aimed at is, however, 
usually unwarranted (59, p. 14-15). 


This appears to conflict with argu- 
ments for specific criteria prominent 
at places in the testing literature. 
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Thus Anastasi (2) makes many state- 
ments of the latter character: ‘‘It is 
only as a measure of a specifically 
defined criterion that a test can be 
objectively validated at all... To 
claim that a test measures anything 
over and above its criterion is pure 
speculation” (p. 67). Yet elsewhere 
this article supports construct valida- 
tion. Tests can be profitably inter- 
preted if we “know the relationships 
between the tested behavior . . . and 
other behavior samples, none of these 
behavior samples necessarily occupy- 
ing the preeminent position of a cri- 
terion” (p. 75). Factor analysis with 
several partial criteria might be used 
to study whether a test measures a 
postulated ‘‘general learning ability.” 
If the data demonstrate specificity of 
ability instead, such specificity is 
“useful in its own right in advancing 
our knowledge of behavior; it should 
not be construed as a weakness of the 
tests’ (p. 75). 

We depart from Anastasi at two 
points. She writes, ‘“The validity of 
a psychological test should not be 
confused with an analysis of the fac- 
tors which determine the behavior 
under consideration.’’ We, however, 
regard such analysis as a most im- 
portant type of validation. Second, 
she refers to “the will-o’-the-wisp of 
psychological processes which are 
distinct from performance”’ (2, p. 77). 
While we agree that psychological 
processes are elusive, we are sympa- 
thetic to attempts to formulate and 
clarify constructs which are evi- 
denced by performance but distinct 
from it. Surely an inductive inference 
based on a pattern of correlations 
cannot be dismissed as ‘‘pure specu- 
lation.” 


Specific Criteria Used Temporarily: 
The ‘‘Bootstraps” Effect 


Even when a test is constructed on 
the basis of a specific criterion, it may 
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ultimately be judged to have greater 
construct validity than the criterion. 
We start with a vague concept which 
we associate with certain observa- 
tions. We then discover empirically 
that these observations covary with 
some other observation which pos- 
sesses greater reliability or is more in- 
timately correlated with relevant ex- 
perimental changes than is the orig- 
inal measure, or both. For example, 
the notion of temperature arises be- 
cause some objects feel hotter to the 
touch than others. The expansion of 
a mercury column does not have face 
validity as an index of hotness. But 
it turns out that (a) there is a statis- 


-tical relation between expansion and 


sensed temperature; (b) observers 
employ the mercury method with 
good interobserver agreement; (c) 
the regularity of observed relations 
is increased by using the thermometer 
(e.g., melting points of samples of the 
same material vary little on the ther- 
mometer; we obtain nearly linear re- 
lations between mercury measures 
and pressure of a gas). Finally, (d) 
a theoretical structure involving un- 
observable microevents—the kinetic 
theory—is worked out which explains 
the relation of mercury expansion to 
heat. This whole process of concep- 
tual enrichment begins with what in 
retrospect we see as an extremely fal- 
lible ‘‘criterion’’—the human tem- 
perature sense. That original criter- 
ion has now been relegated to a pe- 
ripheral position. We have lifted our- 
selves by our bootstraps, but in a 
legitimate and fruitful way. 
Similarly, the Binet scale was first 
valued because children’s scores 
tended to agree with judgments by 
schoolteachers. If it had not shown 
this agreement, it would have been 
discarded along with reaction time 
and the other measures of ability pre- 
viously tried. Teacher judgments 


once constituted the criterion against 
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which the individual intelligence test 
was validated. But if today a child’s 
IQ is 135 and three of his teachers 
complain about how stupid he is, we 
do not conclude that the test has 
failed. Quite to the contrary, if no 
error in test procedure can be argued, 
we treat the test score asa valid state- 
ment about an important quality, 
and define our task as that of finding 
out what other variables—person- 
ality, study skills, etc.—modify 
achievement or distort teacher judg- 
ment. 


EXPERIMENTATION TO INVESTI- 
GATE CONSTRUCT VALIDITY 


Validation Procedures 


We can use many methods in con- 
struct validation. Attention should 
particularly be drawn to Macfar- 
- lane’s survey of these methods as they 
apply to projective devices (41). 

Group differences. If our under- 


standing of a construct leads us to 


expect two groups to differ on the 
test, this expectation may be tested 
directly. Thus Thurstone and Chave 
validated the Scale for Measuring 
Attitude Toward the Church by show- 
ing score differences between church 
members and nonchurchgoers. 
Churchgoing is not the criterion of 
attitude, for the purpose of the test is 
to measure something other than the 
crude sociological fact of church at- 
tendance; on the other hand, failure 
to find a difference would have seri- 
ously challenged the test. 

Only coarse correspondence be- 
tween test and group designation is 
expected. Too great a correspondence 
between the two would indicate that 
the test is to some degree invalid, be- 
cause members of the groups are ex- 
pected to overlap on the test. Intel- 
ligence test items are selected initially 
on the basis of a correspondence to 
age, but an item that correlates .95 
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with age in an elementary school 
sample would surely be suspect. 
Correlation matrices and factor an- 
alysis. If two tests are presumed to 
measure the same construct, a cor- 
relation between them is predicted. 
(An exception is noted where some 
second attribute has positive loading 
in the first test and negative loading 
in the second test; then a low correla- 
tion is expected. This is a testable 
interpretation provided an external 
measure of either the first or the sec- 


ond variable exists.) If the obtained 


correlation departs from the expecta- 
tion, however, there is no way to 
know whether the fault lies in test A, 
test B, or the formulation of the con- 
struct. A matrix of intercorrelations 
often points out profitable ways of 
dividing the construct into more 
meaningful parts, factor analysis be- 
ing a useful computational method 
in such studies. 

Guilford (26) has discussed the 
place of factor analysis in construct 
validation. His statements may be 
extracted as follows: 

“The personnel psychologist wishes 
to know ‘why his tests are valid.’ He 
can place tests and practical criteria 
in a matrix and factor it to identify 
‘real dimensions of human person- 
ality.’ A factorial description is ex- 
act and stable; it is economical in 
explanation; it leads to the creation 
of pure tests which can be combined 
to predict complex behaviors.” It is 
clear that factors here function as 
constructs. Eysenck, in his ‘‘criterion 
analysis’ (18), goes farther than Guil- 
ford, and shows that factoring can be 
used explicitly to test hypotheses 
about constructs. 

Factors may or may not be 
weighted with surplus meaning. Cer- 
tainly when they are regarded as 
“real ‘dimensions’ a great deal of 
surplus meaning is implied, and the 
interpreter must shoulder a substan- 
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tial burden of proof. The alternative 
view is to regard factors as defining a 
working reference frame, located in a 
convenient manner in the ‘‘space”’ 
defined by all behaviors of a given 
type. Which set of factors from a 
given matrix is “most useful’ will 
depend partly on predilections, but 
in essence the best construct is the 
one around which we can build the 
greatest number of inferences, in the 
most direct fashion. 

Studies of internal structure. For 
many constructs, evidence of homo- 
geneity within the test is relevant in 
judging validity. If a trait such as 
dominance is hypothesized, and the 
items inquire about behaviors sub- 
sumed under this label, then the hy- 
pothesis appears to require that these 
items be generally intercorrelated. 
Even low correlations, if consistent, 
would support the argument that 
people may be fruitfully described in 
terms of a generalized tendency to 
dominate or not dominate. The gen- 
eral quality would have power to pre- 
dict behavior in a variety of situa- 
tions represented by the specific 
items. Item-test correlations and 
certain reliability formulas describe 
internal consistency. 

It is unwise to list uninterpreted 
data of this sort under the heading 
“‘validity”’ in test manuals, as some 
authors have done. High internal 
consistency may lower validity. Only 
if the underlying theory of the trait 
being measured calls for high item 
intercorrelations do the correlations 
support construct validity. Negative 
item-test correlations may support 
construct validity, provided that the 
items with negative correlations are 
believed irrelevant to the postulated 
construct and serve as suppressor 
variables (31, p. 431-436; 44). 

Study of distinctive subgroups of 
items within a test may set an upper 
limit to construct validity by showing 
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that irrelevant elements influence 
scores. Thus a study of the PMA 
space tests shows that variance can 
be partially accounted for by a re- 
sponse set, tendency to mark many 
figures as similar (12). An internal 
factor analysis of the PEA Interpre- 
tation of Data Test shows that in ad- 
dition to measuring reasoning skills, 
the test score is strongly influenced 
by a tendency to say “‘probably true”’ 
rather than ‘‘certainly true,’’ regard- 
less of item content (17). On the 
other hand, a study of item groupings 
in the DAT Mechanical Comprehen- 
sion Test permitted rejection of the 
hypothesis that knowledge about 
specific topics such as gears made a 
substantial contribution to scores 
(13). 

Studies of change over occasions. 
The stability of test scores (‘‘retest 
reliability,’’ Cattell’s ‘‘N-technique’’) 
may be relevant to construct valida- 
tion. Whether a high degree of sta- 
bility is encouraging or discouraging 
for the proposed interpretation de- 
pends upon the theory defining the 
construct. 

More powerful than the retest after 
uncontrolled intervening experiences 
is the retest with experimental in- 
tervention. If a transient influence 
swings test scores over a wide range, 
there are definite limits on the extent 
to which a test result can be inter- 
preted as reflecting the typical be- 
havior of the individual. These are 
examples of experiments which have 
indicated upper limits to test valid- 
ity: studies of differences associated 
with the examiner in projective test- 
ing, of change of score under alterna- 
tive directions (‘tell the truth” vs. 
“make yourself look good to an em- 
ployer’), and of coachability of 
mental tests. We may recall Gullik- 
sen’s distinction (27): When the 


coaching is of a sort that improves 
the pupil’s intellectual functioning in 
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school, the test which is affected by 
the coaching has validity as a meas- 
ure of intellectual functioning; if the 
coaching improves test taking but 
not school performance, the test 
which responds to the coaching has 
poor validity as a measure of this 
construct. 

Sometimes, where differences be- 
tween individuals are difficult to 
assess by any means other than the 
test, the experimenter validates by 
determining whether the test can de- 
tect induced intra-individual differ- 
ences. One might hypothesize that 
the Zeigarnik effect is a measure of 
ego involvement, i.e., that with ego 
involvement there is more recall of 
incomplete tasks. To support such 
an interpretation, the investigator 
will try to induce ego involvement on 
some task by appropriate directions 
and compare subjects’ recall with 
their recall for tasks where there was 
a contrary induction. Sometimes the 
intervention is drastic. Porteus finds 
(53) that brain-operated patients 
show disruption of performance on 
his maze, but do not show impaired 
performance on conventional verbal 
tests and argues therefrom that his 
test is a better measure of planfulness. 

Studies of process. One of the best 
ways of determining informally what 
accounts for variability on a test is 
the observation of the person’s pro- 
cess of performance. If it is supposed, 
for example, that a test measures 
mathematical competence, and yet 
observation of students’ errors shows 
that erroneous reading of the ques- 
tion is common, the implications of a 
low score are altered. Lucas in this 
way showed that the Navy Relative 
Movement Test, an aptitude test, 
actually involved two different abili- 
ties: spatial visualization and mathe- 
matical reasoning (39). 

Mathematical analysis of scoring 
procedures may provide important 
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negative evidence on construct valid- 
ity. A recent analysis of “empathy” 
tests is perhaps worth citing (14). 
“Empathy” has been operationally 
defined in many studies by the ability 
of a judge to predict what responses 
will be given on some questionnaire 
by a subject he has observed briefly. 
A mathematical argument has shown, 
however, that the scores depend on 
several attributes of the judge which 
enter into his perception of amy in- 
dividual, and that they therefore can- 
not be interpreted as evidence of his 
ability to interpret cues offered by 
particular others, or his intuition. 


The Numerical Estimate of 
Construct Validity 


There is an understandable tend- 
ency to seek a “construct validity 
coefficient.””. A numerical statement 
of the degree of construct validity 
would be a statement of the propor- 
tion of the test score variance that is 
attributable to the construct variable. 
This numerical estimate can some- 
times be arrived at by a factor analy- 
sis, but since present methods of fac- 
tor analysis are based on linear rela- 
tions, more general methods will 
ultimately be needed to deal with 
many quantitative problems of con- 
struct validation. 

Rarely will it be possible to esti- 
mate definite “construct satura- 
tions,’’ because no factor correspond- 
ing closely to the construct will be 
available. One can only hope to set 
upper and lower bounds to the “‘load- 
ing.”’ Ii “creativity” is defined as 
something independent of knowledge, 
then a correlation of .40 between a 
presumed test of creativity and a test 
of arithmetic knowledge would indi- 
cate that at least 16 per cent of the 
reliable test variance is irrelevant to 
creativity as defined. Laboratory 
performance on problems such as 
Maier’s “hatrack’’ would scarcely be 
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an ideal measure of creativity, but it 
would be somewhat relevant. If its 
correlation with the test is .60, this 
permits a tentative estimate of 36 
per cent as a lower bound. (The esti- 
mate is tentative because the test 
might overlap with the irrelevant por- 
tion of the laboratory measure.) The 
saturation seems to lie between 36 
and 84 per cent; a cumulation of stud- 
ies would provide better limits. 

It should be particularly noted 
that rejecting the null hypothesis 
does not finish the job of construct 
validation (35, p. 284). The problem 
is not to conclude that the test “‘is 
valid’”’ for measuring the construct 
variable. The task is to state as defi- 
nitely as possible the degree of valid- 
ity the test is presumed to have. 


Tue Locic or CONSTRUCT 
VALIDATION 


Construct validation takes place 
when an investigator believes that 
his instrument reflects a particular 
construct, to which are attached cer- 
tain meanings. The proposed inter- 
pretation generates specific testable 
hypotheses, which are a means of 
confirming or disconfirming the claim. 
The philosophy of science which we 
believe does most justice to actual 
scientific practice will now be briefly 
and dogmatically set forth. Readers 
interested in further study of the 
philosophical underpinning are re- 
ferred to the works by Braithwaite 
(6, especially Chapter III), Carnap 
(7; 8, pp. 56-69), Pap (51), Sellars 
(55, 56), Feigl (19, 20), Beck (4), 
Kneale (37, pp. 92-110), Hempel 
(29; 30, Sec. 7). 


The Nomological Net 


The fundamental principles are 
these: 

1. Scientifically 
“make clear what something is 
means to set forth the laws in which 


speaking, to 


a 
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it occurs. We shall refer to the inter- 
locking system of laws which consti- 
tute a theory as a nomological network. 

2. The laws in a nomological net- 
work may relate (a) observable prop- 
erties or quantities to each other; or 
(6) theoretical constructs to observa- 
bles; or (c) different theoretical con- 
structs to one another. These “‘laws”’ 
may be statistical or deterministic. 

3. A necessary condition for a con- 
struct to be scientifically admissible 
is that it occur in a nomological net, 
at least some of whose laws involve 
observables. Admissible constructs 
may be remote from observation, i.e., 
a long derivation may intervene be- 
tween the nomologicals which im- 
plicitly define the construct, and the 
(derived) nomologicals of type a. 
These latter propositions permit pre- 
dictions about events. The construct 
is not ‘‘reduced”’ to the observations, 
but only combined with other ¢on- 
structs in the net to make predictions 
about observables. 

4. “Learning more about” a the- 
oretical construct is a matter of elab- 
orating the nomological network in 
which it occurs, or of increasing the 
definiteness of the components. At 
least in the early history of a con- 
struct the network will be limited, 
and the construct will as yet have 
few connections. 

5. An enrichment of the net such 
as adding a construct or a relation to 
theory is justified if it generates nom- 
ologicals that are confirmed by ob- 
servation or if it reduces the number 
of nomologicals required to predict 
the same observations. When ob- 
servations will not fit into the net- 
work as it stands, the scientist has a 
certain freedom in selecting where to 
modify the network. That is, there 
may be alternative constructs or ways 
of organizing the net which for the 
time being are equally defensible. 

6. We can say that ‘‘operations’’ 
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which are qualitatively very different 
“overlap” or ‘“‘measure the same 
thing’ if their positions in the nomo- 
logical net tie them to the same con- 
struct variable. Our confidence in 
this identification depends upon the 
amount of inductive support we have 
for the regions of the net involved. 
It is not necessary that a direct ob- 
servational comparison of the two 
operations be made—we may be con- 
tent with an intranetwork proof in- 
dicating that the two operations 
yield estimates of the same network- 
defined quantity. Thus, physicists 
are content to speak of the “‘tempera- 
ture’”’ of the sun and the “‘tempera- 
ture’ of a gas at room temperature 
even though the test operations are 
nonoverlapping because this identifi- 
cation makes theoretical sense. 
With these statements of scientif- 
ic methodology in mind, we return 
to the specific problem of construct 
validity as applied to psychological 
tests. The preceding guide rules 


should reassure the ‘‘toughminded,”’ 
who fear that allowing construct 
validation opens the door to noncon- 
firmable test claims. The answer is 
that unless the network makes con- 
tact with observations, and exhibits 


explicit, public steps of inference, 
construct validation cannot be 
claimed. An admissible psychological 
construct must be behavior-relevant 
(59, p. 15). For most tests intended 
to measure constructs, adequate cri- 
teria do not exist. This being the 
case, many such tests have been left 
unvalidated, or a finespun network 
of rationalizations has been offered 
as if it were validation. Rationaliza- 
tion is not construct validation. One 
who claims that his test reflects a 
construct cannot maintain his claim 
in the face of recurrent negative re- 
sults because these results show that 
his construct is too loosely defined to 
yield verifiable inferences. 
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A rigorous (though perhaps prob- 
abilistic) chain of inference is re- 
quired to establish a test as a measure 
of a construct. To validate a claim 
that a test measures a construct, a 
nomological net surrounding the con- 
cept must exist. When a construct is 
fairly new, there may be few speci- 
fiable associations by which to pin 
down the concept. As research pro- 
ceeds, the construct sends out roots 
in many directions, which attach it 
to more and more facts or other con- 
structs. Thus the electron has more 
accepted properties than the neu- 
trino; numerical ability has more than 
the second space factor. 

““Acceptance,”’ which was critical 
in criterion-oriented and _ content 
validities, has now appeared in con- 
struct validity. Unless substantially 
the same nomological net is accepted 
by the several users of the construct, 
public validation is impossible. If A 
uses aggressiveness to mean overt as- 
sault on others, and B’s usage in- 
cludes repressed hostile reactions, 
evidence which convinces B that a 
test measures aggressiveness con- 
vinces A that the test does not. 
Hence, the investigator who proposes 
to establish a test as a measure of a 
construct must specify his network or 
theory sufficiently clearly that others 
can accept or reject it (cf. 41, p. 406). 
A consumer of the test who rejects 
the author’s theory cannot accept 
the author’s validation. He must 
validate the test for himself, if he 
wishes to show that it represents the 
construct as he defines it. 

Two general qualifications are in 
order with reference to the methodo- 
logical principles 1-6 set forth at 
the beginning of this section. Both 
of them concern the amount of 
‘“‘theory,”’ in any high-level sense of 
that word, which enters into a con- 
struct-defining network of laws or 
lawlike statements. We do not wish 
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to convey the impression that one al- 
ways has a very elaborate theoretical 
network, rich in hypothetical proc- 
esses or entities. 

Constructs as inductive summaries. 
In the early stages of development of 
a construct or even at more advanced 
stages when our orientation is thor- 
oughly practical, little or no theory 
in the usual sense of the word need be 
involved. In the extreme case the hy- 
pothesized laws are formulated en- 
tirely in terms of descriptive (obser- 
vational) dimensions although not all 
of the relevant observations have 
actually been made. 

The hypothesized network “goes 
beyond the data”’ only in the limited 
sense that it purports to characterize 
the behavior facets which belong to 
an observable but as yet only par- 
tially sampled cluster; hence, it gen- 
erates predictions about hitherto un- 
sampled regions of the phenotypic 
space. Even though no unobserva- 


bles or high-order theoretical con- 


structs are introduced, an element of 
inductive extrapolation appears in 
the claim that a cluster including 
some elements not-yet-observed has 
been identified. Since, as in any sort- 
ing or abstracting task involving a 
finite set of complex elements, sev- 
eral nonequivalent bases of cate- 
gorization are available, the investi- 
gator may choose a hypothesis which 
generates erroneous predictions. The 
failure of a supposed, hitherto un- 
tried, member of the cluster to be- 
have in the manner said to be charac- 
teristic of the group, or the finding 
that a nonmember of the postulated 
cluster does behave in this manner, 
may modify greatly our tentative 
construct. 

For example, one might build an 
intelligence test on the basis of his 
background notions of “‘intellect,” 
including vocabulary, arithmetic cal- 
culation, general information, simi- 
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larities, two-point threshold, reaction 
time, and line bisection as subtests. 
The first four of these correlate, and 
he extracts a huge first factor. This 
becomes a second approximation of 
the intelligence construct, described 
by its pattern ot loadings on the four 
tests. The other three tests have 
negligible loading on any common 
factor. On this evidence the investi- 
gator reinterprets intelligence as 
“manipulation of words.’’ Subse- 
quently it is discovered that test- 
stupid people are rated as unable to 
express their ideas, are easily taken in 
by fallacious arguments, and misread 
complex directions. These data sup- 
port the “‘linguistic’’ definition of in- 
telligence and the test’s claim of 
validity for that construct. But then 
a block design test with pantomime 
instructions is found to be strongly 
saturated with the first factor. Im- 
mediately the purely ‘‘linguistic’’ in- 
terpretation of Factor I becomes sus- 
pect. This finding, taken together 
with our initial acceptance of the 
others as relevant to the background 
concept of intelligence, forces us to 
reinterpret the concept once again. 

If we simply list the tests or traits 
which have been shown to be satur- 
ated with the ‘factor’ or which be- 
long to the cluster, no construct is 
employed. As soon as we even sum- 
marize the properties of this group of 
indicators—we are already making 
some guesses. Intensional characteri- 
zation of a domain is hazardous since 
it selects (abstracts) properties and 
implies that new tests sharing those 
properties will behave as do. the 
known tests in the cluster, and that 
tests not sharing them will not. 

The difficulties in merely ‘‘charac- 
terizing the surface cluster” are strik- 
ingly exhibited by the use of certain 
special and extreme groups for pur- 
poses of construct validation. The 
P, scale of MMPI was originally de- 
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rived and cross-validated upon hos- 
pitalized patients diagnosed ‘‘Psy- 
chopathic personality, asocial and 
amoral type”’ (42). Further research 
shows the scale to have a limited de- 
gree of predictive and concurrent 
validity for ‘“delinquency’’ more 
broadly defined (5,28). Several stud- 
ies show associations between P, 
and very special ‘‘criterion’’ groups 
which it would be ludicrous to iden- 
tify as “the criterion” in the tradi- 
tional sense. If one lists these hetero- 
geneous groups and tries to charac- 
terize them intensionally, he faces 
enormous conceptual difficulties. For 
example, a recent survey of hunting 
accidents in Minnesota showed that 
hunters who had “carelessly’’ shot 
someone were significantly elevated 
on Pg when compared with other 
hunters (48). This is in line with one’s 
theoretical expectations; when you 
ask MMPI “experts’”’ to predict for 
such a group they invariably predict 
P,or M, or both. The finding seems 


therefore to lend some slight support 
to the construct validity of the Pg 


scale. But of course it would be non- 
sense to define the Pg component 
“operationally” in terms of, say, ac- 
cident proneness. We might try to 
subsume the original phenotype and 
the hunting-accident proneness under 
some broader category, such as “‘Dis- 
position to violate society's rules, 
whether legal, moral, or just sensible.” 
But now we have ceased to have a 
neat operational criterion, and are 
using instead a rather vague and 
wide-range class. Besides, there is 
worse to come. We want the class 
specification to cover a group trend 
that (nondelinquent) high school stu- 
dents judged by their peer group as 
least ‘‘responsible’’ score over a full 
sigma higher on P, than those judged 
most ‘‘responsible’’ (23, p. 75). Most 
of the behaviors contributing to such 
sociometric choices fall well within 
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the range of socially permissible ac- 
tion; the proffered criterion specifica- 
tion is still toorestrictive. Again, any 
clinician familiar with MMPI lore 
would predict an elevated P, on a 
sample of (nondelinquent) profes- 
sionalactors. Chyatte’s confirmation 
of this prediction (10) tends to sup- 
port both: (a) the theory sketch of 
“what the P, factor is, psychologi- 
cally”; and (6) the claim of the P, 
scale to construct validity for this 
hypothetical factor. Let the reader 
try his hand at writing a brief pheno- 
typic criterion specification that will 
cover both trigger-happy hunters 
and Broadway actors! And if he 
should be ingenious enough to achieve 
this, does his definition also encom- 
pass Hovey’s report that high Pg 
predicts the judgments “not shy” 
and “unafraid of mental patients” 
made upon nurses by their supervi- 
sors (32, p. 143)? And then we have 
Gough's report that low Pg is asso- 
ciated with ratings as “good-natured” 
(24, p. 40), and Roessell’s data show- 
ing that high Pg, is predictive of 
“dropping out of high school” (54). 
The point is that all seven of these 
“criterion” dispositions would be 
readily guessed by any clinician hav- 
ing even superficial familiarity with 
MMPI interpretation; but to medi- 
ate these inferences explicitly re- 
quires quite a few hypotheses about 
dynamics, constituting an admittedly 
sketchy (but far from vacuous) net- 
work defining the genotype psycho- 
pathic deviate. 

Vagueness of present psychological 
laws. This line of thought leads di- 
rectly to our second important quali- 
fication upon the network schema. 
The idealized picture is one of a tidy 
set of postulates which jointly entail 
the desired theorems; since some of 
the theorems are coordinated to the 
observation base, the system consti- 
tutes an implicit definition of the 
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theoretical primitives and gives them 
an indirect empirical meaning. In 
practice, of course, even the most ad- 
vanced physical sciences only approx- 
imate this ideal. Questions of ‘‘cate- 
goricalness’’ and the like, such as 
logicians raise about pure calculi, are 
hardly even statable for empirical 
networks. (What, for example, would 
be the desiderata of a “well-formed 
formula” in molar behavior theory?) 
Psychology works with crude, half- 
explicit formulations. We do not 
worry about such advanced formal 
questions as “‘whether all molar-be- 
havior statements are decidable by 
appeal to the postulates’ because 
we know that no existing theoretical 
network suffices to predict even the 
known descriptive laws. Neverthe- 
less, the sketch of a network is there; 
if it were not, we would not be saying 
anything intelligible about our con- 
structs. We do not have the rigorous 
implicit definitions of formal calculi 
(which still, be it noted, usually per- 
mit of a multiplicity of interpreta- 
tions). Yet the vague, avowedly in- 
complete network still gives the con- 
structs whatever meaning they do 
have. When the network is very in- 
complete, having many strands miss- 
ing entirely and some constructs tied 
in only by tenuous threads, then the 
“implicit definition’ of these con- 
structs is disturbingly loose; one 
might say that the meaning of the 
constructs is underdetermined. Since 
the meaning of theoretical constructs 
1s set forth by stating the laws in which 
they occur, our incomplete knowledge 
of the laws of nature produces a vague- 
ness in our constructs (see Hempel, 
30; Kaplan, 34; Pap, 51). We will be 
able to say “‘what anxiety is’’ when 
we know all of the laws involving it; 
meanwhile, since we are in the pro- 
cess of discovering these laws, we do 
not yet know precisely what anxiety 
is. 
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CONCLUSIONS REGARDING THE NET- 
WORK AFTER EXPERIMENTATION 


The proposition that x per cent of 
test variance is accounted for by the 
construct is inserted into the accepted 
network. The network then generates 
a testable prediction about the rela- 
tion of the test scores to certain other 
variables, and the investigator gath- 
ers data. If prediction and result are 
in harmony, he can retain his belief 
that the test measures the construct. 
The construct is at best adopted, 
never demonstrated to be “correct.” 

We do not first “prove” the theory, 
and then validate the test, nor con- 
versely. In any probable inductive 
type of inference from a pattern of ob- 
servations, we examine the relation 
between the total network of theory 
and observations. The system in- 
volves propositions relating test to 
construct, construct to other con- 
structs, and finally relating some of 
these constructs to observables. In 
ongoing research the chain of infer- 
ence is very complicated. Kelly and 
Fiske (36, p. 124) give a complex dia- 
gram showing the numerous infer- 
ences required in validating a predic- 
tion from assessment techniques, 
where theories about the criterion sit- 
uation are as integral a part of the 
prediction as are the test data. A pre- 
dicted empirical relationship permits 
us to test all the propositions leading 
to that prediction. Traditionally the 
proposition claiming to interpret the 
test has been set apart as the hypoth- 
esis being tested, but actually the evi- 
dence is significant for all parts of the 
chain. If the prediction is not con- 
firmed, any link in the chain may be 
wrong. 

A theoretical network can be di- 
vided into subtheories used in making 
particular predictions, All the events 
successfully predicted through a sub- 
theory are of course evidence in favor 
of that theory. Such a subtheory 
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may be so well confirmed by volumi- 
nous and diverse evidence that we can 
reasonably view a particular experi- 
ment as relevant only to the test’s 
validity. If the theory, combined 
with a proposed test interpretation, 
mispredicts in this case, it is the latter 
which must be abandoned. On the 
other hand, the accumulated evidence 
for a test’s construct validity may be 
so strong that an instance of mispre- 
diction will force us to modify the 
subtheory employing the construct 
rather than deny the claim that the 
test measures the construct. 

Most cases in psychology today lie 
somewhere between these extremes. 
Thus, suppose we fail to find a 
greater incidence of “homosexual 
signs” in the Rorschach records of 
paranoid patients. Which is more 
strongly disconfirmed—the  Ror- 
schach signs or the orthodox theory 
of paranoia? The negative finding 
shows the bridge between the two to 
be undependable, but this is all we 
can say. The bridge cannot be used 
unless one end is placed on solider 
ground, The investigator must de- 
cide which end it is best to relocate. 

Numerous successful predictions 
dealing with phenotypically diverse 
“criteria”’ give greater weight to the 
claim of construct validity than do 
fewer predictions, or predictions in- 
volving very similar behaviors. In 
arriving at diverse predictions, the 
hypothesis of test validity is con- 
nected each time to a subnetwork 
largely independent of the portion 
previously used. Success of these 
derivations testifies to the inductive 
power of the test-validity statement, 
and renders it unlikely that an 
equally effective alternative can be 
offered. 


Implications of Negalive Evidence 


The investigator whose prediction 
and data are discordant must make 
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strategic decisions. His result can be 
interpreted in three ways: 

1. The test does not measure the 
construct variable. 

2. The theoretical network which 
generated the hypothesis is incorrect. 

3. The experimental design failed 
to test the hypothesis properly. 
(Strictly speaking this may be an- 
alyzed as a special case of 2, but in 
practice the distinction is worth mak- 
ing.) 

For further research. If a specific 
fault of procedure makes the third a 
reasonable possibility, his proper re- 
sponse is to perform an adequate 
study, meanwhile making no report. 
When faced with the other two alter- 
natives, he may decide that his test 
does not measure the construct ade- 
quately. Following that decision, he 
will perhaps prepare and validate a 
new test. Any rescoring or new inter- 
pretative procedure for the original 
instrument, like a new test, requires 
validation by means of a fresh body of 
data. 

The investigator may regard inter- 
pretation 2 as more likely to lead to 
eventual advances. It is legitimate 
for the investigator to call the net- 
work defining the construct into ques- 
tion, if he has confidence in the test. 
Should the investigator decide that 
some step in the network is unsound, 
he may be able to invent an alterna- 
tive network. Perhaps he modifies 
the network by splitting a concept 
into two or more portions, e.g., by 
designating types of anxiety, or per- 
haps he specifies added conditions 
under which a generalization holds. 
When an investigator modifies the 
theory in such a manner, he is now re- 
quired to gather a fresh body of data to 
test the altered hypotheses. This step 
should normally precede publication 
of the modified theory. If the new 
data are consistent with the modified 
network, he is free from the fear that 
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his nomologicals were gerrymandered 
to fit the peculiarities of his first sam- 
ple of observations. He can now trust 
his test to some extent, because his 
test results behave as predicted. 

The choice among alternatives, like 
any strategic decision, is a gamble as 
to which course of action is the best 
investment of effort. Is it wise to 
modify the theory? That depends on 
how well the system is confirmed by 
prior data, and how well the modifi- 
cations fit available observations. Is 
it worth while to modify the test in 
tne hope that it will fit the construct? 
That depends on how much evidence 
there is—apart from this abortive ex- 
periment—to support the hope, and 
also on how much it is worth to the 
investigator's ego to salvage the test. 
The choice among alternatives is a 
matter of research planning. 

For practical use of the test. The 
consumer can accept a test as a meas- 
ure of a construct only when there is a 
strong positive fit between predic- 
tions and subsequent data. When 
the evidence from a proper investiga- 
tion of a published test is essentially 
negative, it should be reported as a 
stop sign to discourage use of the test 
pending a reconciliation of test and 
construct, or final abandonment of 
the test. If the test has not been pub- 
lished, it should be restricted to re- 
search use until some degree of 
validity is established (1). The con- 
sumer can await the results of the in- 
vestigator’s gamble with confidence 
that proper application of the scien- 
tific method will ultimately tell 
whether the test has value. Until the 
evidence is in, he has no justification 
for employing the test as a basis for 
terminal decisions. The test may 
serve, at best, only as a source of sug- 
gestions about individuals to be con- 
firmed by other evidence (15, 47). 

There are two perspectives in test 
validation. From the viewpoint of 
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the psychological practitioner, the 
burden of proof is on the test. A test 
should not be used to measure a trait 
until its proponent establishes that 
predictions made from such measures 
are consistent with the best available 
theory of the trait. In the view of the 
test developer, however, both the 
test and the theory are under scru- 
tiny. He is free to say to himself pri- 
vately, ‘‘If my test disagrees with the 
theory, so much the worse for the 
theory.”” This way lies delusion, un- 
less he continues his research using a 
better theory. 


Reporting of Positive Results 


The test developer who finds posi- 
tive correspondence between his pro- 
posed interpretation and data is ex- 
pected to report the basis for his 
validity claim. Defending a claim of 
construct validity is a major task, not 
to be satisfied by a discourse without 
data. The Technical Recommenda- 
tions have little to say on reporting of 
construct validity. Indeed, the only 
detailed suggestions under that head- 
ing refer to correlations of the test 
with other measures, together with a 
cross reference to some other sections 
of the report. The two key princi- 
ples, however, call for the most com- 
prehensive type of reporting. The 
manual for any test “should report all 
available information which will as- 
sist the user in determining what psy- 
chological attributes account for vari- 
ance in test scores’’ (59, p. 27). And, 
“The manual for a test which is used 
primarily to assess postulated attri- 
butes of the individual should outline 
the theory on which the test is based 
and organize whatever partial valid- 
ity data there are to show in what 
way they support the theory’’ (59, 
p. 28). It is recognized, by a classifi- 
cation as ‘“‘very desirable” rather than 
“essential,” that the latter recom- 








CONSTRUCT VALIDITY 


mendation goes beyond present prac- 
tice of test authors. 

The proper goals in reporting con- 
struct validation are to make clear 
(a) what interpretation is proposed, 
(6) how adequately the writer be- 
lieves this interpretation is substan- 
tiated, and (c) what evidence and rea- 
soning lead him to this belief. With- 
out a the construct validity of the 
test is of no use to the consumer. 
Without 6 the consumer must carry 
the entire burden of evaluating the 
test research. Without c the con- 
sumer or reviewer is being asked to 
take a and b on faith. The test man- 
ual cannot always present an exhaus- 
tive statement on these points, but 
it should summarize and indicate 
where complete statements may be 
found. 

To specify the interpretation, the 
writer must state what construct he 
has in mind, and what meaning he 
gives to that construct. For a con- 
struct which has a short history and 
has built up few connotations, it will 
be fairly easy to indicate the pre- 
sumed properties of the construct, 
i.e., the nomologicals in which it ap- 
pears. For a construct with a longer 
history, a summary of properties and 
references to previous theoretical dis- 
cussions may be appropriate. It is 
especially critical to distinguish pro- 
posed interpretations from other 
meanings previously given the same 
construct. The validator faces no 
small task; he must somehow com- 
municate a theory to his reader. 

To evaluate his evidence calls for a 
statement like the conclusions from a 
program of research, noting what is 
well substantiated and what alterna- 
tive interpretations have been con- 
sidered and rejected. The writer must 
note what portions of his proposed 
interpretation are speculations, ex- 
trapolations, or conclusions from in- 
sufficient data. The author has an 
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ethical responsibility to prevent un- 
substantiated interpretations from 
appearing as truths. A claim is un- 
substantiated unless the evidence for 
the claim is public, so that other sci- 
entists may review the evidence, criti- 
cize the conclusions, and offer alterna- 
tive interpretations. 

The report of evidence in a test 
manual must be as complete as any 
research report, except where ade- 
quate public reports can be cited. 
Reference to something “‘observed by 
the writer in many clinical cases” is 
worthless as evidence. Full case re- 
ports, on the other hand, may be a 
valuable source of evidence so long as 
these cases are representative and 
negative instances receive due atten- 
tion. The report of evidence must be 
interpreted with reference to the 
theoretical network in such a manner 
that the reader sees why the author 
regards a particular correlation or ex- 
periment as confirming (or throwing 
doubt upon) the proposed inter- 
pretation. Evidence collected by 
others must be taken fairly into ac- 
count. 


VALIDATION OF A COMPLEX TEST 
“As a WHOLE” 


Special questions must be consid- 
ered when we are investigating the 
validity of a test which is aimed to 
provide information about several 
constructs. In one sense, it is naive 
to inquire “Is this test valid?’’ One 
does not validate a test, but only a 
principle for making inferences. If a 
test yields many different types of 
inferences, some of them can be valid 
and others invalid (cf. Technical 
Recommendation C2: “The manual 
should report the validity of each 
type of inference for which a test is 
recommended”). From this point of 
view, every topic sentence in the 
typical book on Rorschach inter- 
pretation presents a hypothesis re- 
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quiring validation, and one should 
validate inferences about each aspect 
of the personality separately and in 
turn, just as he would want informa- 
tion on the validity (concurrent or 
predictive) for each scale of MMPI. 

There is, however, another defensi- 
ble point of view. If a test is purely 
empirical, based strictly on observed 
connections between response to an 
item and some criterion, then of 
course the validity of one scoring key 
for the test does not make validation 
for its other scoring keys any less 
necessary. But a test may be de- 
veloped on the basis of a theory which 
in itself provides a linkage between 
the various keys and the various cri- 
teria. Thus, while Strong’s Voca- 
tional Interest Blank is developed 
empirically, it also rests on a ‘‘the- 
ory” that a youth can be expected to 
be satisfied in an occupation if he has 
interests common to men now happy 
in the occupation. When Strong finds 
that those with high Engineering in- 
terest scores in college are prepon- 
derantly in engineering careers 19 
years later, he has partly validated 
the proposed use of the Engineer 
score (predictive validity). Since the 
evidence is consistent with the theory 
on which all the test keys were built, 
this evidence alone increases the pre- 
sumption that the other keys have 
predictive validity. How strong is 
this presumption? Not very, from 
the viewpoint of the traditional skep- 
ticism of science. Engineering in- 
terests may stabilize early, while in- 
terests in art or management or social 
work are still unstable. A claim can- 
not be made that the whole Strong 
approach is valid just because one 
score shows predictive validity. But 
if thirty interest scores were investi- 
gated longitudinally and all of them 
showed the type of validity predicted 
by Strong’s theory, we would indeed 
be caviling to say that this evidence 
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gives no confidence in the long-range 
validity of the thirty-first score. 

Confidence in a theory is increased 
as more relevant evidence confirms 
it, but it is always possible that to- 
morrow’s investigation will render the 
theory obsolete. The Technical Rec- 
ommendations suggest a rule of rea- 
son, and ask for evidence for each 
type of inference for which a test is 
recommended. It is stated that no 
test developer can present predictive 
validities for all possible criteria; 
similarly, no developer can run all 
possible experimental tests of his 
proposed interpretation. But the rec- 
ommendation is more subtle than ad- 
vice that a lot of validation is better 
than a little. 

Consider the Rorschach test. It is 
used for many inferences, made by 
means of nomological networks at 
several levels. At a low level are the 
simple unrationalized correspond- 
ences presumed to exist between cer- 
tain signs and psychiatric diagnoses. 
Validating such a sign does nothing to 
substantiate Rorschach theory. For 
other Rorschach formulas an explicit 
a priori rationale exists (for instance, 
high F% interpreted as implying 
rigid control of impulses). Each time 
such a sign shows correspondence 
with criteria, its rationale is sup- 
ported just a little. At a still higher 
level of abstraction, a considerable 
body of theory surrounds the general 
area of outer control, interlacing 
many different constructs. As evi- 
dence cumulates, one should be able 
to decide what specific inference- 
making chains within this system 
can be depended upon. One should 
also be able to conclude—or deny— 
that so much of the system has stood 
up under test that one has some con- 
fidence in even the untested lines in 
the network. 

In addition to relatively delimited 
nomological networks surrounding 
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control or aspiration, the Rorschach 
interpreter usually has an overriding 
theory of the test as a whole. This 
may be a psychoanalytic theory, a 
theory of perception and set, or a the- 
ory stated in terms of learned habit 
patterns. Whatever the theory of the 
interpreter, whenever he validates an 
inference from the system, he obtains 
some reason for added confidence in 
his overriding system. His total the- 
ory is not tested, however, by experi- 
ments dealing with only one limited 
set of constructs. The test developer 
must investigate far-separated, inde- 
pendent sections of the network. The 
more diversified the predictions the 
system is required to make, the 
greater confidence we can have that 
only minor parts of the system will 
later prove faulty. Here we begin to 
glimpse a logic to defend the judg- 
ment that the test and its whole inter- 
pretative system is valid at some level 
of confidence. 

There are enthusiasts who would 
conclude from the foregoing para- 
graphs that since there is some evi- 
dence of correct, diverse predictions 
made from the Rorschach, the test 
as a whole can now be accepted as 
validated. This conclusion overlooks 
the negative evidence. Just one find- 
ing contrary to expectation, based on 
sound research, is sufficient to wash a 
whole theoretical structure away. 
Perhaps the remains can be salvaged 
to form a new structure. But this 
structure now must be exposed to 
fresh risks, and sound negative evi- 
dence will destroy it in turn. There 
is sufficient negative evidence to pre- 
vent acceptance of the Rorschach and 
its accompanying interpretative 
structures as a whole. So long as any 
aspects of the overriding theory 
stated for the test have been discon- 
firmed, this structure must be rebuilt. 

Talk of areas and structures may 
seem not to recognize those who 
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would interpret the personality ‘‘glo- 
bally.”’ They may argue that a test is 
best validated in matching studies. 
Without going into detailed questions 
of matching methodology, we car. ask 
whether such a study validates the 
nomological network ‘‘as a whole.” 
The judge does employ some network 
in arriving at his conception of his 
subject, integrating specific infer- 
ences from specific data. Matching 
studies, if successful, demonstrate 
only that each judge’s interpretative 
theory has some validity, that it is 
not completely a fantasy. Very high 
consistency between judges is re- 
quired to show that they are using 
the same network, and very high suc- 
cess in matching is required to show 
that the network is dependable. 

If inference is less than perfectly 
dependable, we must know which as- 
pects of the interpretative network 
are least dependable and which are 
most dependable. Thus, even if one 
has considerable confidence in a test 
“as a whole”’ because of frequent suc- 
cessful inferences, one still returns as 
an ultimate aim to the request of the 
Technical Recommendation for sep- 
arate evidence on the validity of each 
type of inference to be made. 


RECAPITULATION 


Construct validation was intro- 
duced in order to specify types of re- 
search required in developing tests 
for which the conventional views on 
validation are inappropriate. Per- 
sonality tests, and some tests of abil- 
ity, are interpreted in terms of attri- 
butes for which there is no adequate 
criterion. This paper indicates what 
sorts of evidence can substantiate 
such an interpretation, and how such 
evidence is to be interpreted. The 
following points made in the discus- 
sion are particularly significant. 

1. A construct is defined implicitly 
by a network of associations or propo- 
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sitions in which it occurs. Constructs 
employed at different stages of re- 
search vary in definiteness. 

2. Construct validation is possible 
only when some of the statements in 
the network lead to predicted rela- 
tions among observables. While some 
observables may be regarded as “‘cri- 
teria,’’ the construct validity of the 
criteria themselves is regarded as 
under investigation. 

3. The network defining the con- 
struct, and the derivation leading to 
the predicted observation, must be 
reasonably explicit so that validating 
evidence may be properly inter- 
preted. 

4. Many types of evidence are rele- 
vant to construct validity, including 
content validity, interitem correla- 
tions, intertest correlations, test-‘‘cri- 
terion” correlations, studies of sta- 
bility over time, and stability under 
experimental intervention. High cor- 
relations and high stability may con- 
stitute either favorable or unfavor- 
able evidence for the proposed inter- 
pretation, depending on the theory 
surrounding the construct. 

5. When a predicted relation fails 
to occur, the fault may lie in the pro- 
posed interpretation of the test or in 
the network. Altering the network 
so that it can cope with the new ob- 
servations is, in effect, redefining the 
construct. Any such new interpreta- 
tion of the test must be validated by 
a fresh body of data before being ad- 
vanced publicly. Great care is re- 
quired to avoid substituting a pos- 
teriori rationalizations for proper val- 
idation. 
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6. Construct validity cannot gen- 
erally be expressed in the form of a 
single simple coefficient. The data 
often permit one to establish upper 
and lower bounds for the proportion 
of test variance which can be attri- 
buted to the construct. The integra- 
tion of diverse data into a proper in- 
terpretation cannot be an entirely 
quantitative process. 

7. Constructs may vary in nature 
from those very close to “pure de- 
scription”’ (involving little more than 
extrapolation of relations among ob- 
servation-variables) to highly the- 
oretical constructs involving hypoth- 
esized entities and processes, or mak- 
ing identifications with constructs of 
other sciences. 

8. The investigation of a test's con- 
struct validity is not essentially dif- 
ferent from the general scientific pro- 
cedures for developing and confirming 
theories. 


Without in the least advocating con- 
struct validity as preferable to the 
other three kinds (concurrent, predict- 
ive, content), we do believe it im- 
perative that psychologists make a 
place for it in their methodological 
thinking, so that its rationale, its 
scientific legitimacy, and its dangers 
may become explicit and familiar. 
This would be preferable to the wide- 
spread current tendency to engage 
in what actually amounts to con- 
struct validation research and use of 
constructs in practical testing, while 
talking an “‘operational’’ method- 
ology which, if adopted, would force 
research into a mold it does not fit. 
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It is the purpose of this paper to 
present a general picture of psychol- 
ogy in the Arab Near East. Particu- 
lar attention will be given to places 
and institutions where most of the 
psychological activity is concen- 
trated, the type of work carried on 
in each of those places, and persons 
who seem to be leaders in the field. 
The emphasis is on the contemporary 
scene, but background factors are 
introduced where they seem to shed 
light on the current situation. 

There is some disagreement among 
scholars on the inclusiveness of the 
terms ‘‘Near East’”’ and ‘‘Middle 
East."” We have included Egypt, 


Lebanon, Syria, Iraq, Jordan, Saudi 
Arabia, Bahrein Island, and Kuwait 
under the phrase ‘‘Arab Near East.” 


Some British writers use ‘‘Arab Mid- 
dle East”’ to refer to the same area. 
The population of this region is ap- 
proximately 40 million, of which 
about half is in Egypt. The total 
area is more than 1,600,000 square 
miles. Thus the region is about one 
fourth as populous, and more than 
half as large, as the continental 
United States. Population density 
is considerably greater than these 
figures would suggest, for well over 
three-fourths of the area is desert 
and the population is concentrated 
in the fertile, watered areas. 

The recorded political history of 
the Near East goes back thousands 
of years. We shall not attempt to 
trace the evolvement of these coun- 
tries to their present form. It is 
sufficient here to note that all are 
independent sovereign nations ex- 
cept Bahrein and Kuwait. These 
two are British protectorates. 


The information presented in this 
paper is based principally on first- 
hand observation and personal inter- 
views by the authors. The senior 
author has taught in the Near East 
for more than two years. The junior 
author was born in the Near East 
and has spent most of his life there. 
Preliminary information about psy- 
chology in the area was gathered 
through correspondence and informal 
discussions. When it was decided 
to make an intensive survey of the 
area, information was solicited by 
mail from all known psychologists in 
each country. On the basis of this 
information, interviews were sched- 
uled in the spring of 1953 with leading 
psychologists in each country. One 
or both of us then visited each state, 
talked at length with the selected 
psychologists, and personally in- 
spected laboratories, libraries, class- 
rooms, etc. Visits were not made 
to Kuwait, Bahrein, or Saudi Arabia 
because we knew of no psychologists 
there. For information about these 
countries we relied on the statements 
of educators from them who visited 
Lebanon, and on the report of Dr. 
Habib Kurani, who inspected some 
of their educationai facilities in 1952. 
Because of the relative importance of 
Egyptian psychology, additional in- 
formation about work there was ob- 
tained through the kind cooperation 
of Dr. Hanna Rizk of the American 
University in Cairo. He prepared 
and circulated a detailed question- 
naire to all Egyptians engaged in full 
or part-time psychological work. 
From the returned questionnaires we 
were able to get an unusually com- 
plete picture of psychology in Egypt. 
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GENERAL CHARACTERISTICS 

Psychology in the Arab Near East 
sprang from and is nourished by 
Western psychology. British, French, 
and American influences can be seen 
—and distinguished—in the work of 
psychologists throughout the area. 
The kind and strength of these in- 
fluences is determined by several 
factors: the nature of Arab culture, 
the social and political forces exerted 
by Western nations, and the compara- 
tive vigor of the several transplanted 
psychologies. Consequently Near 
Eastern psychology is of interest not 
only for its own sake, but also as a 
field for the study of forces affecting 
the development of various psycho- 
logical interests and viewpoints. 

In the days of the Ottoman Em- 
pire, the French school system be- 
came the model for Arab schools. 
With the breakup of that empire in 
World War I, French influence gained 
in the Levant but schools of the 
British type began to flourish in 
Iraq, Jordan, Palestine, and Egypt; 
today government schools still follow 
French and British organizational 
patterns. 

In Syria, Lebanon, and Pales- 
tine there are many private schools 
which were established by Western 
missionaries before the Ottomans 
created any local schools. The 
private schools with foreign support 
continue to exercise great influence 
on the education of the area. Con- 
sequently the Arab who begins his 
study of psychology in his own land, 
whether in public or private colleges, 
will approach the subject from a 
Western point of view. If he seeks 
graduate training, he will probably 
go to France, Britain, or the United 
States. 

There is remarkably little com- 
munication among psychologists of 
the area. Dependence on Western 
universities for advanced training 
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reduces the amount of contact among 
Arabs of different nationalities. A 
French-trained psychologist in Leb- 
anon may not read the journal 
published by a French-trained psy- 
chologist in Egypt, although he will 
be quite au courant of the publica- 
tions from Paris and Lyon. A British- 
trained psychologist in Damascus 
may know nothing of the work of 
the Baghdad psychologist who also 
received his doctorate in England. 
This isolation is augmented by severe 
travel restrictions imposed by many 
of the Arab states. Since there is 
little integration of psychological 


activities in this area, we shall deal 
with psychology in each of the several 
countries in turn. The order of pres- 
entation is based on our: judgment of 
the amount of psychological activity 
to be found in each country. 


Ecypt! 

There are approximately 40 per- 
sons in Egypt who are engaged full 
time in psychological work. More 
than 30 of these are engaged in aca- 
demic work, and most of the re- 
mainder are employed by the govern- 
ment. Two psychologists were cabi- 
net ministers at the time this survey 
was made (2). 

Two generalizations can be made 
about psychology in Egypt. First, 
the influence of British universities 
is very strong. Second, there is a 
close alliance between psychology 
and education. The second fact is 
probably related to the first. There 
are many individuals, of course, to 
whom these statements would not 
apply, and there is some evidence 
that British influence is less strong 
today than five years ago. Neverthe- 
less, more than half of the psychol- 


' We wish to express our appreciation to Dr. 
Hanna Rizk of the American University at 
Cairo for invaluable assistance in collecting 
data on Egyptian psychologists. 
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ogists in Egypt today received their 
advanced degrees from British in- 
stitutions, and British journals rank 
next to Egyptian journals as vehi- 
cles for publication of research data. 
Only two persons reported having 
published in American journals, and 
two reported having published in 
France. 

Nearly all of Egypt's psychologists 
are to be found in Cairo and Alexan- 
dria. A majority of the psycholo- 
gists engaged in academic work are 
in one or another of the Institutes of 
Education. The Higher Institute of 
Education for Men, now a part of 
Cairo’s Ibrahim University, has 11 
psychologists on its staff. Four psy- 
chologists are with the Higher Insti- 
tute of Education for Women in 
Cairo, two are with the Higher 
Institute of Education in Alexandria, 
and at least six more are on the staffs 
of the various normal schools. As 
might be expected from their insti- 


tutional affiliations, more psycholo- 
gists publish in the Egyptian Journal 
of Education than in the Egyptian 


Journal of Psychology. Indeed, this 
latter publication has_ recently 
changed from a quarterly to an an- 
nual, 

Dean of the Institute of Higher 
Education for Men, Cairo, is Dr. 
Abdel Aziz El Koussy, whose gradu- 
ate training was taken at Birmingham 
under Valentine and at London under 
Spearman and Burt. Specialists in 
clinical and developmental psychol- 
ogy, statistics, tests, and learning are 
on the Institute’s faculty. These 
specialists hold Ph.D. degrees (or 
the equivalent) from Cambridge, 
London, Reading, and Leeds. All 
undergraduate students at the Insti- 
tute take at least one course in psy- 
chology, and students in the newly 
organized graduate school take ap- 
proximately half of their course work 
in psychology. In 1952, 81 persons 
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were taking graduate work, and 24 
had passed the master’s examination. 
Of those who had passed, about 
one-third were actively working on 
a master’s thesis. The Institute is 
authorized to award both the M.A. 
and Ph.D. in educational psychology. 
At this writing no Ph.D. had been 
awarded. 

The laboratory at the Institute is 
similar to that in many American 
universities. Several rooms are 
available; each is furnished with 
worktables and cabinets filled with 
apparatus. The apparatus include 
chronoscopes, mazes, mirror-drawing 
devices, galvanometers, kymographs, 
manual dexterity boards, instru- 
ments designed to measure mechani- 
cal ability, intelligence tests, and a 
large vuriety of paper-and-pencil 
tests. Undergraduate students who 
take experimental psychology are 
expected to perform four experiments 
from a list of experiments provided 
by the instructor. Candidates for 
the master’s degree perform 20 ex- 
periments. English and American 
books serve as references. Foreign 
journa!s are not readily available to 
the students. 

The Institute offers some work in 
clinical psychology and there is a 
clinic connected with it. The de- 
mands for the services of the clinic 
far exceed its capacity. Patients 
are limited to children and students. 
Three professors devote full time to 
the clinic, and a neurologist, a psy- 
chiatrist, and a social worker are 
available on a part-time basis. Dr. 
Ishak Ramzy is in charge. He holds 
a Ph.D. from London and was 
trained in psychoanalysis at the 
London Institute of Psychoanalysis, 
where he studied with Anna Freud. 

There is an active testing move- 
ment in Egypt, and the Institute is 
one of the foci of this activity. 
Adaptations of the Binet and of 
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several of the Spearman tests have 
been in use for years. Pioneers and 
leaders in this area were Dr. Koussy 
and Dr. Ismael Kabbani, formerly 
with the Institute and now Minister 
of Education. At the present time 
the Army is using intelligence tests 
and other aptitude tests. 

The Faculty of Arts of Ibrahim 
University, Cairo, offers a license in 
psychology which is roughly equiva- 
lent to the American master of arts 
degree. Chairman of the department 
is Dr. Mustafa Ziwer who holds both 
the M.D. and Dr. es Let. from the 
University of Paris. The latter de- 
gree was earned under Piéron. Al- 
though the department is small, an 
imposing array of courses is offered 
by drawing on the staff of the Insti- 
tute of Higher Education and on the 
staff of the Faculty of Arts, Univer- 
sity of Cairo. The program requires 
four years beyond the baccalaureate 
deuxieme partie, and the first class 
to have completed this entire new 
program was graduated in 1954. It 
is expected that the graduates will be 
employed by the Ministries of Edu- 
cation and Social Welfare. Work to- 
ward a Ph.D. degree is to be offered 
in the future. 

At the University of Cairo, the 
psychology department is a part of 
the philosophy section of the Faculty 
of Arts. Students working toward a 
license in philosophy take one psy- 
chology course in each of the last 
three years. Chairman of the de- 
partment is Dr. Yusuf Mourad, a 
graduate of the University of Paris. 
He is co-editor, with Dr. Ziwer, of 
the Egyptian Journal of Psychology, 
and has contributed to it extensively. 
For his advanced courses he assigns 
works in French by Guillaume, 
Piéron, Wallon, and others, and 
French translations of such American 
authors as Woodworth, Carmichael, 
and Krech and Crutchfield. Courses 
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in experimental psychology are of- 
fered at Fouad by Dr. Muhammad 
Nagati, who did his graduate work 
at Yale University. 

The Higher Institute of Education 
for Women, Cairo, and the Alexan- 
dria Higher Institute of Education 
for Men both offer work in education- 
al psychology. Each of these insti- 
tutes has at least three full-time 
professors in this field. The American 
University at Cairo offers courses in 
psychology as part of its education 
and social studies curricula. Psy- 
chological training is also a part of the 
program given by the Cairo Insti- 
tute of Social Work. 

Al Azhar, Cairo, one of the world’s 
oldest universities, is traditionally 
devoted primarily to _ theological 
training. There are today three sec- 
tions at the higher level: theology, 
Islamic law, and Arabic language. 
The section devoted to the teaching 
of the Arabic language now offers 
courses in general and educational 
psychology, and the theology section 
offers one course in psychology. 


LEBANON? 


Since the time of the Crusades, 
and especially for the past hundred 
years, Lebanon has had close cul- 
tural ties with the West. At present 
one finds in the book stores of its 
cities large numbers of Western 
books, including scientific and pseu- 
doscientific works on psychology. 
Works on psychoanalysis are so 
popular that a leading Beirut news- 
paper ran in 1953 a series of editorials 
denouncing the “efforts of Western 
psychologists” to turn attention from 
social ills to problems of infantile 
frustration. There is even one dia- 
netics practitioner in Beirut who re- 


* We wish to express our appreciation to 
Miss Suad Khalluf for assistance in obtaining 
information about psychology in Lebanon. 
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ceived training from Hubbard him- 
self. 

There are three universities in 
Lebanon, and these universities are 
the foci of psychological activity in 
the country. The Centre d'Etudes 
Supérieures Francais de Beyrouth is 
a branch of the Arts and Sciences 
Faculty of the University of Lyon. 
It offers a license degree which re- 
quires that the student complete 
five certificates. The five can be 
obtained in two or three years. Of 
the many certificates which may be 
chosen, three are in the field of psy- 
chology: general, social, and educa- 
tional. Each certificate requires 
from six to eight hours per week for 
an academic year. The professors 
of the psychology courses hold chairs 
of philosophy. They place great 
emphasis on preparation for the an- 
nual examinations, which are issued 
from Lyon. They report, in familiar 
terms, that their teaching keeps them 
too busy for research. 


The first psychological work pub- 


lished in the Arabic language in 
modern times seems to have been 
"Al duriis al ’awaliyah fil falsafah a! 
‘gliyah (First Lessons in Mental 
Philosophy). It was written in 1874 
by the American missionary Daniel 
Bliss, who was founder and first 
president of the Syrian Protestant 
College (now the American Univer- 
sity of Beirut). Today the psychology 
department of the American Univer- 
sity resembles that of the liberal arts 
college in the United States. Courses 
are taught in English, and American 
textbooks are used. An eight-hour 
course in experimental and a three- 
hour course in statistics are required 
of all majors. The university em- 
phasizes research, and for more than 
a decade the department has con- 
centrated on research in social psy- 
chology. This tradition was begun 
by S. C. Dodd, now at the Univer- 
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sity of Washington, who was for many 
years chairman of the sociology and 
psychology departments. Closely 
allied with the American University 
is the Beirut College for Women. The 
College stresses work in develop- 
mental, guidance, and educational 
psychology. 

The Lebanese National University 
held its first classes in 1950, and the 
psychology program there is in an 
embryonic stage. Some courses are 
offered now and it is planned to offer 
a certificate in psychology in 1956. 
The organization and content of the 
courses reflect the influence of the 
French university and, to a lesser 
extent, of the American University. 


SYRIA 


Syrian psychology centers in the 
Syrian State University located in 
Damascus. The university is quite 
new. Although the Schools of Medi- 
cine and Law date back to 1923, it 
was only in 1946 that the Schools of 
Arts, Science, and Engineering, and 
the Higher Normal School, were 
joined with the older faculties (1). 
Some of the buildings occupied by 
the university are former Ottoman 
and French army barracks. Psychol- 
ogy courses are offered in both the 
philosophy faculty of the College of 
Arts and Sciences and in the educa- 
tion department of the Higher Nor- 
mal School. In order to obtain a 
license in the philosophy faculty, a 
student must study philosophy, psy- 
chology, and sociology. If he re- 
ceives the degree, he is entitled to 
teach these subjects in secondary 
schools. The secondary schools, like 
those of France, carry the students 
to approximately the American 
sophomore level. Of the four pro- 
fessors on the philosophy faculty, 
two are philosophers and the other 
two are psychologists on loan from 
the Higher Normal School. 
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The education department is quite 
active in psychology, and the mem- 
bers of its faculty offer a wide variety 
of courses. Chairman of the depart- 
ment is Dr. Fakhir Akil, who re- 
ceived B.A. and M.A. degrees from 
the American University of Beirut, 
and a Ph.D. from London. His 
doctoral work was done under Drs. 
Philpott and Burt. Each of the other 
full-time professors holds a license 
from Egypt and is completing a doc- 
torate in France. Consequently, 
Syrian psychology is a mixture of 
American, English, and French ap- 
proaches. The professors are writing 
textbooks in Arabic, and they assign 
collateral reading in both French 
and American works. Woodworth’s 
Experimental Psychology and Mor- 
gan’s Physiological Psychology are 
used in their French editions. The 
laboratory is well equipped with 
pursuit meters, mirror-drawing ap- 
paratus, form boards, intelligence 
test materials, personality test ma- 


terials (including Rorschach, TAT, 


Spearman Abilities, and Szondi), 
and many other devices. We were 
impressed to learn at the time of 
our visit to the laboratory that the 
annual budget for new laboratory 
equipment was approximately $4,500. 

Students achieving satisfactory 
scores on entrance examinations may 
study at the Higher Normal School 
at government expense. If they wish 
to accept the subsidy, they sign a 
contract with the Ministry of Educa- 
tion whereby they agree to work for 
a specific number of years for that 
ministry upon completion of their 
work. Moreover, the top student 
each year is sent abroad to work for 
a doctor’s degree. Partly as a result 
of this arrangement, there is strong 
student interest in educational psy- 
chology. The trend seems to be 
away from philosophical psychology, 
in the French tradition, toward 
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educational psychology of the British 
and American variety. This trend 
may be a function of Dr. Akil’s 
leadership, and it is too early to say 
whether it will continue when stu- 
dents now studying abroad return to 
Syria. Because of currency prob- 
lems, transportation costs, and other 
factors, more Syrians are studying 
in France than in Britain and Ameri- 
ca together (4). In Syria as in Egypt, 
there is more psychology work in the 
education department than in the 
philosophy faculty. 


IRAQ 


A visit to modern Baghdad gives 
one the impression that there is a 
lively curiosity about psychology in 
literate circles. | Bookstores stock 
and presumably sell many English- 
language psychology textbooks, even 
though these textbooks are not as- 
signed in any Iraqi university. Wood- 
worth’s Experimental Psychology, for 
example, is apparently read by some 
Iraqis out of sheer intellectual curi- 
osity! 

Although many of the professors 
are American educated, education 
in Iraq is dominated by persons 
trained in Britain. As in Egypt, 
educational psychology is taught in 
both urban and rural teacher training 
institutions. Teachers of these 
courses are persons who were trained 
primarily in the field of education. 

There is a department of education 
and psychology in the Higher Teach- 
er’s Training College, located in 
Baghdad. Several courses are offered 
in psychology, and there are plans for 
expansion. The principal members of 
the staff are Dr. Ibrahim Muhyi, 
who completed his doctorate at 
Columbia University, and Dr. Abdul 
Aziz Al-Bassam, whose graduate 
study was at University College, 
London. Each of these professors 
also offers courses at Baghdad's Col- 
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lege of Arts and Sciences and at the 
Queen Alliya College for Women. 
This last-named school offers psy- 
chology in the department of social 
work. As far as we know, there is no 
psychological laboratory in Iraq, and 
little psychological research. 


THE PERSIAN GULF AREA 


There is hardly any psychological 
activity on the West (Arab) side of 
the Persian Gulf. In Jordan there 
are two teacher training institutions 
where Egyptian-trained educators 


offer elementary work in psychology. 
There are no trained psychologists 
in Bahrein or Kuwait, and none in 
the Arabian peninsula except the 
Americans who are employed by the 
Arabia-American Oil Company. 


CONCLUSION 


From this brief survey it can be 
seen that psychology in the Arab 
Near East is essentially academic. 
Where the French academic tradition 
prevails, psychology is closely allied 
with philosophy. Where British 
academic tradition prevails, psychol- 
ogy is allied with education, and there 
is a strong emphasis on testing and 
measurement. There is some con- 
cern with research in Egypt and 
Lebanon, but this activity is subordi- 
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nate throughout the area to trans- 
lation of important Western books. 
and adaptation to local use of West- 
ern tests. 

In his discussion of ‘the sociology 
of psychology” Sanford (3, p. 85) has 
suggested that dictatorial regimes 
are antipsychological. Yet the mili- 
tary regimes in both Egypt and Syria 
seem eager to utilize whatever con- 
tributions psychology can make to 
the perceived social problems. Inter- 
est in psychology on the part of these 
governments exceeds that of the 
parliamentary governments which 
preceded them. This does not 
demonstrate that antidemocratic 
trends favor psychology, for the 
political form is not always the best 
index of the essential nature of a 
society. 

The prognosis is favorable for the 
growth of psychology in this part of 
Western Asia. Changes in long- 
established social institutions, in- 


creased urbanization, increased liter- 


acy, and rising per capita income all 
point to increased utilization of 
psychological understanding. There 
is some suspicion that psychology 
is an instrument of Western imperial- 
ism, but there is reason to hope that 
this suspicion will be overcome as 
increasing numbers of Arabs become 
psychologists. 


APPENDIX 
BIBLIOGRAPHY OF SELECTED TITLES 
There is no published bibliography of recent books in the Arabic language which deals with 
psychological topics. The best source of information about such books is probably 'Al Sijil al 
Thakafi (Cultural Register), published annually by the Egyptian Ministry of Education in 
Cairo. The register contains information about publications in many fields, including psychol- 


ogy and education. 


We have listec below some books which we consider to be representative of the best works 


on psychology published in Arabic from 1945 to 1953. 


inclusive. 


1. ‘Al "Ahwani, "Ahmad Fu’ad. [Outline of 
psychology.] Cairo: Amiriya 
1948. 

2. Al Athama, Wafic. [Modern psychology.] 
Damascus: Hashimiyah Press, 1950. 

3. Al Draubi, Sami, & ‘Al Jamali, Hafiz. 


Press, 


The list is representative rather than 


[Psychology.] Damascus: Dar-’al-Yaq- 
tha Press, 1949. 

4. Al Khawli, William. [The integrated per- 
sonality.) Cairo: Mukhaymar Press, 
1948, 


5. Al Qusi, Abdul Aziz. [Foundations of 
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mental health.} (2nd Ed.) Cairo: Al 
Nahda Press, 1952. 

6. Al Qusi, Abdul Aziz. [Foundations of 
Psychology.) Cairo: Al Nahda Library, 
1950. 

7. Al Qusi, Abdul Aziz, et al. [Language and 
thought.] Cairo: Amiriyah Press, 1948. 

8. Al Qusi, Abdul Aziz. [Statistics in educa- 
tion and psychology.] Cairo: Al Nahda 
Library, 1949. 

9. Al Mulayhi, Abdul Mun’im. [Psychologi- 
cal development.}] Cairo: Misr Library, 
1950. 

10. Al Tawil, Tawfic. 
Adab Press, 1945. 

11. "Akil, Fakhir. [Psychology and its appli- 
cation to education.] Damascus: 
Maktabat Al’uloom Wal’Adab, 1945. 

12. Barakat, Mohamad Khalifah. [Analysis 
of personality.] Cairo: Misr Library, 
1951. 

13. Barakat, Mohamad Khalifah. 
logical clinics.] 
1952. 


[Dreams.] Cairo: Al 


[Psycho- 
Cairo: Misr Library, 
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pathic behavior.] Cairo: Dar Al Ma’rif, 
1952. 

15. Khalaf Allah Muhammed. [A psychologi- 
cal approach to the study of literature 
and literary criticism.] Cairo: Lajnat 
Al ta’leaf Wal tarjamat Wal Nashr, 
1947, 

16. Mitri, Amin. [The feebleminded.] Alexan- 
dria: Dar Nashr 'Al Thaqafah, 1948. 

17. Murad, Yusuf. [Elements of general psy- 
chology.] Cairo: Dar Al Ma'rif Press, 
1948. 

18. Najati, Mohamad 'Uthman. [Military 
psychology.] Cairo: Dar Al Ma'rif, 
1953. 

19. Ramzi, Ishaq. [Individual psychology.] 
Cairo: Dar Al Ma‘rif Press, 1946. 

20. Saliba, Jamil. [Psychology.] Damascus: 
"Ibn Zaydoun Press, 1948. 

21. Salih, 'Ahmad. [Educational psychology.] 
Cairo: Al Nahda Library, 1951. 

22. Subay’i, Adnan. [A survey of child psy- 
chology.] Damascus: ’Arafah Library, 


14. Jirjis, Sabri. [The problem of psycho- 1952. 
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THE ROLE OF MOTIVATION IN VERBAL LEARNING 
AND PERFORMANCE! 
I. E. FARBER 
State University of Towa 


The factors affecting performance 
in learning situations are custom- 
arily divided into two classes—the 
associative and the nonassociative. 
The former have been the subject of 
most research in the area of verbal 
behavior, and the information con- 
cerning them constitutes by far the 
better part, in quality as well as 
quantity, of what is known about 
this kind of behavior. In recent 
years, however, there has been an 
increasing interest in those factors 
that are commonly supposed to have 
nonassociative properties, particu- 
larly those subsumed under the head- 
ing of “motivation.” 

Although the term ‘“‘motivational”’ 
is often used in reference to non- 


associative factors, conventional psy- 


chological usage imputes to motives 
an associative aspect also. Melton, 
for instance, considers motives as 
having three functions: the ener- 
gizing, the selective, and the direct- 
ing. According to his view, the 
energizing function “...may be 
most simply conceptualized as a 
result of the lowering of the general 
(italics added) reaction threshold by 
the motive’ (30, p. 672); the selec- 


1 The present paper is a revision of a paper 
entitled ‘Motivational factors in verbal learn- 
ing,”” presented May 8, 1953, in the Sym- 
posium on Psychology of Learning Basic to 
Military Training Problems, sponsored by the 
Panel on Training and Training Devices, 
Research and Development Board. It was 
prepared as part of a project on the influence 
of motivation on performance in learning, 
under Contract N9 onr-93802, Project NR 
154-107, between the State University of 
Iowa and the Office of Naval Research. The 
helpful criticisms of this manuscript by Drs. 
K. W. Spence and J. S. Brown are acknowl- 
edged with thanks. ‘ 


tive function refers to the reinforcing 
effects of the achievement of goals 
relevant to the motive; and the 
directing function is associative or 
cognitive, considering motives, as 
part of the stimulus complex, as 
having a given habit strength in 
respect to various specific responses. 
According to Shaffer (41), a motive 
involves both ‘‘drive’’ and ‘mech- 
anism,” the former term denoting a 
variable that arouses mass activity, 
and the latter, an acquired response 
tendency. Hull (23), in his discussion 
of primary motivation, supposed that 
while “‘... all drives alike are able 
to sensitize all habits’ (p. 247), 
“associated with every drive (D) 
is a characteristic drive stimulus 
(Sp)...” (p. 253), which gives 
specificity and direction to behavior. 
And Dollard and Miller (13), in 
discussing the functional properties 
of stimuli, refer to their drive as- 
pect-——that which impels action gen- 
erally, and their cue aspect—that 
which leads to differential response. 

The supposition that motivational 
variables have steering (associative) 
properties as well as energizing and 
reinforcing (drive) properties does 
not, of course, imply that the two 
kinds of properties are identical. 
Indeed, the virtue of the foregoing 
analyses of the concept of motiva- 
tion lies in their separation of these 
functions. The associative function 
of a motive or any other variable is 
identified in terms of its tendency to 
elicit or facilitate a limited class of 
responses only. The drive function 
of a variable is demonstrated if: (a) 
its presence energizes or intensifies 
indiscriminately all reaction tend- 
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encies existing in a given situation; 
and/or (6) its elimination or reduc- 
tion in magnitude is reinforcing, i.e., 
leads to the increased probability 
of recurrence, in the same situation, 
of the responses preceding its modifi- 
cation. Motives, it is presumed, have 
both functions, and if a variable 
clearly does not have both, its status 
as a motive is questionable. 

But, as Brown has recently pointed 
out, “‘an examination of contempo- 
rary discussions of motivation sug- 
gests that one of the major sources of 
misunderstanding is the failure of 
most writers to distinguish clearly 
between drives or motives, on the one 
hand, and habits or reaction tenden- 
cies on the other” (5, p. 2). Accord- 
ing to his view, all instances of di- 
rected behavior are attributable, not 
to drives or motives as such, but 
rather to the capacity of stimulus 
cues, including those accompanying 
drives states, to elicit particular re- 
actions.” 

Brown's restriction of the use of 
the terms ‘‘motive’’ and ‘‘motiva- 
tional’’ to refer to dynamogenic 
functions alone accords with Hull's 


* Postman has suggested that the notion of a 
nondirective drive may make the concept of 
drive expendable. Thus, he proposes that 
the explanation of variability in behavior, 
despite the constancy of stimulus situation and 
habit strength, might well be sought in the 
“changing patterns and intensities of drive 
stimuli” (37, p. 57). This sort of formulation, 
in point of fact, bears a close resemblance to 
that of Dollard and Miller (13), who define a 
drive as a strong stimulus. It does not, how- 
ever, as Postman further suggests, endanger 
the Hullian distinction between gH» and sEp, 
nor for that matter, the somewhat similar 
Tolmanian distinction between cognition and 
performance. Hull's D (as opposed to Sp) 
ts nondirective; a concept like drive would 
presumably still be needed to explain at least 
some kinds of reinforcement phenomena; 
and intensity of stimulation, in Hull's last 
formulations, is itself considered a motiva- 
tional variable multiplying habit strength 
(cf. 47, 48). 
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recent formulations, as discussed by 
Spence (47, 48), and is believed to 
be in the best ‘interests of termi- 
nological clarity. However, these 
terms will be used in the present 
paper in their more traditional (and 
ambiguous) meaning, to refer to 
states or processes having both direct- 
ing and dynamogenic components. 
The directive aspects of motives will 
be referred to variously as “‘associa- 
tive,” “‘cognitive,”” or “habitual,” 
whereas the term “drive” will be 
used to refer to the nondirective, 
energizing and reinforcing aspects, in 
accordance with Hull’s usage of the 
concept D. 

Since certain variables not usually 
classified as motivational, e.g., or- 
dinary environmental stimuli, may 
have strong associative connections 
even though they have little or no 
drive value, it is assumed that it is 
their drive function, basically, that 
distinguishes motivational variables 
from other kinds. Thus, the mere 
demonstration that a given variable 
has directive characteristics is not 
enough to qualify it as motivational. 
Furthermore, even if a_ variable 
qualifies as motivational because of 
its evident energizing or reinforcing 
effects under some circumstances, 
its effects upon behavior in other 
circumstances may depend primarily 
upon its associative properties, i.e., 
its tendency to elicit particular re- 
sponses. On the other hand, the fact 
that a variable has associative prop- 
erties does not preclude the possi- 
bility of its having drive properties 
as well. 

The implications of this general 
view are here considered in respect 
to the interpretation of the role of 
certain variables, commonly _re- 
garded as motivational, in verbal 
learning and performance. Although 
the discussion is somewhat arbitrar- 
ily limited to only a few such vari- 
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ables in relation to human verbal 
behavior, it is believed that these 
variables are not unrepresentative 
of motivational variables in general, 
and also, that the considerations 
raised concerning the analysis of their 
effects apply as well to nonverbal 
behavior, in both animals and men 
(ef. 15). 

Experimentally 


Manipulated (En- 


vironmentally Defined) Motives 


Electric shock. \n a large number 
of investigations of the effects of 
motivation, the state of S is more 
or less under the control of the in- 
vestigator. These ‘experimentally 
manipulated”’ states have, for the 
most part, been acquired. However, 
one primary motivational state whose 
effect upon verbal learning and per- 
formance has received considerabie 
attention is that induced by electric 
shock. 

To say that the effects of electric 
shock are not very well understood 
is an understatement. Nevertheless, 
an examination of studies involving 
its use is instructive, since it in- 
dicates many of the considerations 
that must attend analyses of so-called 
motivational variables in general. 
First of all, it seems clear that elec- 
tric shocks qualify as motivational by 
virtue of their drive-producing ef- 
fects. The role of electric shock as a 
reinforcer is well known, and its 
general energizing effect has been 
demonstrated in a number of studies 
(e.g., 1, 35, 42, 51). But these are 
by no means its only functions. For 
instance, electric shocks may be 
informative. Thus, under certain 
conditions, particularly when appro- 
priate verbal instructions are given, 
electric shock may lead to improved 
performance, whether administered 
for wrong responses or for right ones 
(28). It is probably gratuitous, and 
even misleading, in these instances, 
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to attribute the effects of the shock 
solely to its drive characteristics. 
Since shock has not only drive value, 
but also cue value, its associative or 
cognitive aspects must be taken into 
account. 

Even in those instances in which 
shock is noninformative, i.e., not 
related in any systematic way to 
correct or incorrect responses, it is 
necessary to consider its response- 
producing properties. The fact that 
shock tends to elicit withdrawal, or 
vocalization, or muscular contrac- 
tion in rats or human beings, may 
be a consequence, not of its drive 
characteristics, but rather, of its 
association with either learned or 
innate response tendencies. Since 
shock is an intense stimulus, capable 
of eliciting a variety of responses, 
it is not surprising that in certain 
situations it may elicit responses 
that are incompatible with those 
elicited by other stimulus compon- 
ents of the situation. Certain kinds 
of verbal performance are perhaps 
particularly susceptible to disrup- 
tion (interference) by shock, though 
there seem to have been no investiga- 
tions specifically designed to dis- 
cover whether verbal behavior is 
more susceptible in this respect than 
nonverbal behavior of various kinds, 
particularly when other factors, such 
as level of practice, are held constant. 

Although the disrupting effect of 
shock, when it occurs, is often attri- 
buted to the disorganizing influence 
of emotion, it seems unnecessary, at 
least in some situations, to attribute 
the interference to anything more 
than the introduction of an intense, 
extraneous stimulus which produces 
strong competing responses. This 
interpretation may also apply to 
other kinds of prepotent stimuli that 
have a distracting or disrupting 
effect, e.g., loud noises, loss of sup- 
port, or simply talking to S. 
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It must be noted that some kinds 
of verbal behavior, e.g., swearing, 
are likely to be elicited by shock or 
other noxious stimuli as a result of 
learning. If such responses are 
considered ‘‘correct’’ in verbal learn- 
ing situations, shock might be ex- 
pected to facilitate performance. 
Depending, then, upon the particular 
responses that have been acquired 
and the particular requirements of 
the test situation, shock might be 
either beneficial or detrimental. Un- 
fortunately, little is known concern- 
ing the relative strength of various 
kinds of verbal response to shock. It 
does not seem improbable, however, 
that at least some of the effects of 
shock are attributable to learned 
associations rather than to its drive 
value as such. 

Hunger. A somewhat more con- 
fident assessment of the role of stimu- 
lus cues may be made in relation to 
certain studies of the effects of the 
primary drive of hunger. Thus, there 
is evidence, not without its excep- 
tions, to be sure, that increasing 
hunger increases the probability of 
occurrence of verbal responses re- 
ferring to food. When this does occur, 
as both Brown (5) and Postman (36) 
have pointed out, it is most easily 
explained on the supposition that 
responses relating to food are more 
strongly associated with hunger stim- 
uli than are most other responses. 

This is not to say there is no evi- 
dence that hunger, defined in terms 
of food deprivation, functions as a 
drive. Studies showing that the 
receipt of food following deprivation 
is reinforcing, or that the presence 
of hunger has a generally enhancing 
effect upon response tendencies, at- 
test to its drive properties. But when 
the responses elicited or intensified 
in the presence of a drive are ‘‘need- 
related,”’ i.e., when they are responses 
that have been conditioned to the 
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cues characteristic of a given drive, 
their explanation obviously requires 
no appeal to drive qua drive. Studies 
showing that the frequency of verbal 
responses relating to food or eating 
in the presence of impoverished or 
ambiguous verbal or visual stimuli 
increases as a function of hunger are 
interesting and important. But they 
appear to deal, for the most part, 
with the associative rather than the 
drive properties of hunger. 

Fear (anxiety) resulting from noxi- 
ous stimulation. Many verbal learn- 
ing studies involving electric shock 
are designed to investigate its effects 
upon subsequent performance under 
nonshock conditions. It is not im- 
probable that behavior under these 
conditions is in part a function of 
purely perseverative effects of pain, 
or what a number of investigators 
(e.g., 1, 42, 43) have termed ‘‘emo- 
tionality." However, the motiva- 
tional state usually considered in 
this connection is that presumably 
aroused by environmental cues asso- 
ciated with the shock. The state 
thus aroused is an acquired one, 
namely, fear or anxiety. The demon- 
stration that fear, so defined, has 
nonassociative properties rests pri- 
marily, thus far, upon animal experi- 
ments indicating that the reduction 
of fear is reinforcing (31), or that its 
occurrence intensifies ongoing re- 
sponses that are not themselves 
produced by the fear (7). 

As in the case of hunger, many of 
the experimental studies of the 
effects of fear on human behavior 
involve the learning or retention of 
motive-related materials, i.e., partic- 
ular words or other items that had 
previously appeared in close tem- 
poral contiguity with a noxious 
stimulus. Asa result of conditioning, 
such items might be expected to 
elicit not only fear, but also some of 
the other responses previously elicited 
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by the shock. But in addition, as 
Dollard and Miller (13) point out, 
fear itself brings with it a num- 
ber of response tendencies, either 
learned or unlearned. (This holds 
true, of course, even though the fear 
is produced by stimuli other than 
those comprising the material to be 
learned or recalled.) 

The empirical evidence concern- 
ing the effects of fear on verbal per- 
formance has not altogether 
consistent. Under certain conditions 
the production of fear seems to facili- 
tate the recall or recognition of verbal 
materials (12, 34). More often, how- 
ever, it seems to elicit responses 
that successfully compete with verbal 
behavior. Considered in this manner, 
fear-produced responses might well 
be classed among those more prosaic 
variables usually considered as de- 
terminants of associative interference 
or facilitation. We have, 
very little information 
the kinds of response 


been 


as yet, 
concerning 
tendencies 


ordinarily produced by fear, or the 
way in which they interact with other 
response tendencies in specific situa- 
tions. There is reason to believe that 
fear may have particularly intense 
stimulus properties, and, other things 
being equal, may therefore elicit 


unusually strong responses, of a 
sort not ordinarily observed in labora- 
tory studies of forgetting (cf. 40). 
But this does not in itself necessitate 
an explanation of its effects in other 
than associative terms. Again, it 
must be emphasized that this is not 
to deny that fear acts as a drive. It 
does deny that disruption (or for that 
matter, facilitation) of behavior by 
fear is in itself an adequate basis 
for the supposition that it has drive 
properties (15). 

Anxiety aroused by socially tabooed 
materials. Somewhat related to in- 
vestigations using items associated 


with noxious stimulation are the 
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studies involving socially tabooed 
verbal materials. In these studies 
it is supposed that certain words or 
phrases arouse an acquired anxiety 
drive, not because of their associa- 
tion with punishment in the experi- 
mental situation, but because of pre- 
viously experienced social punish- 
ments, administered either for the 
overt usage of the verbal expression 
itself, for behavior labeled by the 
verbal expression, or for behavior 
associated with stimuli thus labeled. 
Anxiety aroused in this manner is 
sometimes regarded as more similar 
to the sort of anxiety considered 
by clinicians as a determinant of 
repression than that produced by 
electric shock (40). 

In many of the studies in which it 
has been found that the learning, 
retention, or recognition of socially 
tabooed or negatively valued items 
is impaired, the interpretation is 
often in terms of unconsciously 
motivated defense mechanisms. This 
sort of interpretation has been seri- 
ously challenged on two grounds: 
first, that the mechanism involved 
is typically far from unconscious; 
and secondly, that it primarily in- 
volves habitual rather than purely 
motivational mechanisms. 

The first objection stresses the 
possibility that the overt expression 
of socially unacceptable responses is 
at least sometimes deliberately and 
consciously inhibited (e.g., 21, 36, 
57). The second objection rests on 
the finding by Howes and Solomon 
(22) that the visual recognition 
thresholds for words vary with their 
relative frequency of usage, as de- 
termined by the Thorndike-Lorge 
word counts. By taking account of 
the large difference in the mean 
frequencies of usage of the taboo 
and neutral words used in an inves- 
tigation by McGinnies (29), Howes 
and Solomon (21) were easily able 
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to account for McGinnies’ finding 
that the taboo words had the high- 
er thresholds. Postman also has 
adopted the view that high thres- 
holds for negatively valued words 
are explicable in terms of their rela- 
tive habit strengths. Pointing out 
that the report of such words may 
be inhibited by the tendency to give 
more frequently used neutral words 
that have elements in common with 
the taboo words, he has concluded 
that 

...it is not necessary to ascribe selective 
functions to the immediate motivational con- 
ditions of the perceiver. Defensive and/or 
vigilant thresholds as well as systematic pre- 
recognition hypotheses may be consequences 
of the relative strengths of response disposi- 
tions aroused by specific stimuli (36, p. 80). 


This interpretation, it should be 
noted, is not inconsistent with the 
view that socially disapproved words 
may arouse anxiety, and that the 
anxiety acts as a drive. It is not 
improbable that the low frequency 
of usage of these words is itself 
due to their evocation of anxiety 
and to the reinforcement of their 
suppression resulting from anxiety 
reduction.’ 

Failure defined by verbal instruc- 
tions. One of the most widely used 
techniques for inducing so-called 
motivational changes in verbal learn- 
ing situations is the use of instruc- 
tions by E indicating that S has 
failed, As Lazarus, Deese, and Osler 


* This account is clearly neutral in respect 
to the question of the usefulness of the con- 
cept of “perceptual defense."’ It affirms the 
possibility that anxiety cues may produce 
responses that successfully compete with cor- 
rect recognition, and is altogether in accord 
with the view that there may be marked indi- 
vidual differences in the amount of anxiety 
aroused by various cues and/or the nature of 
the responses elicited by this anxiety. Further- 
more, following Dollard and Miller (13), it 
freely admits the possibility that the individ- 
uals concerned may sometimes be uncon- 
scious of the cues eliciting the anxiety. 
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(26) have observed, a distinction 
must be made between the purely 
motivational (drive ?) effects of fail- 
ure instructions and their informa- 
tive or distracting effects. Thus, the 
problems of distinguishing between 
the drive and the associative com- 
ponents of motivational variables 
must be considered in relation to 
failure experiences also. 

Experimental studies have usually 
tended to show that when failure 
instructions are not applied to specif- 
ic responses, but are used, instead, 
to characterize general performance, 
subsequent performance in _ verbal 
learning situations is impaired. As 
a result of such observations, it is 
generally believed that statements 
expressing approval of a learner's 
performance are more beneficial to 
his subsequent performance than 
statements indicating disapproval. 
At the same time, however, it is 
also rather generally believed that 
excessive praise may decrease motiva- 
tion and thus impair performance, 
whereas some kinds or degrees of 
censure may increase motivation and 
thus improve performance. This 
indecision with respect to the effects 
of failure characterizes treatments of 
the effects of frustrating events in 
general. For instance, Child and 
Waterhouse write: 

Indeed, the conflict appears strikingly in 
some genera] textbooks in psychology. In 
a chapter on thinking and reasoning frustra- 
tion is viewed as the condition for more organ- 
ized behavior, and in a chapter on emotion it 
is viewed as the condition for less organized 
behavior. The failure to use a common term 
such as frustration in the two chapters ap- 


parently permits the contradiction to go un- 
noticed (10, p. 127). 


In their subsequent discussion, 
these writers have presented a not- 
ably useful analysis of the manner 
in which the responses produced 
either directly or indirectly by the 
frustrating event might either im- 
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prove or impair performance. They 
suppose that prevention of the domi- 
nant response in the hierarchy may 
lead to its extinction, with the result 
that initially weaker responses are 
now evoked; or the frustrating event 
may modify the situation in such a 
way that some new response is 
elicited, either by some changed 
element of the situation or by the 
frustrating event itself. These pos- 
sible effects of frustrating events, it 
should be noted, relate exclusively 
to the modification of cues and/or to 
the relative strengths of habits asso- 
ciated with these cues. 

This emphasis by Child and Water- 
house upon the associative aspects of 
frustration extends also to their 
treatment of the possible drive-pro- 
ducing consequences of frustrating 
events. Thus, they suppose that 
“...frustration may operate to 
increase the motivation supporting 
the goal-oriented activity and there- 
by to improve the quality of per- 


formance’”’ (10, p. 136), or conversely, 
to produce a drive state of intense 
general emotion which impairs per- 


“ 


formance either ‘‘... because the 
emotional responses themselves are 
to some extent incompatible with the 
ongoing instrumental activity’ (10, 
p. 137), or because the emotion pro- 
duces other incompatible responses. 
It would appear, then, that the only 
properties explicitly attributed by 
these writers to the motivational 
states that might be produced by 
frustrating events are directive. 
They suggest, correctly, that the 
foregoing treatment has something in 
common with the ‘emotional’ in- 
terpretation of frustration behavior 
by Brown and Farber (6). But it 
seems, in general, to fall more clearly 
within the class of ‘‘nonemotional’’ 
interpretations, as discussed by the 
latter writers, since it deals with the 
possible behavioral consequences of 
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frustration without appealing to the 
concept of a frustration-produced 
drive having nonassociative proper- 
ties. As a matter of fact, until re- 
cently there has not been much 
evidence of a positive nature to sup- 
port the widespread notion that 
frustration produces (or acts as) a 
drive. Consequently, it has been 
doubtful whether, from the present 
point of view, frustrating events 
qualify as motivational variables 
(cf. 6). However, recent studies by 
Amsel and Roussel (2) and by 
Lambert and Solomon (25) indicate 
that at least some kinds of frustrating 
events affect animal behavior in the 
manner characteristic of a drive. 

Whether failure, defined in terms 
of verbal instructions, has a compar- 
able effect upon verbal behavior still 
remains to be demonstrated. Un- 
fortunately, under most conditions of 
verbal learning, there is no easy way 
to separate associative from possible 
drive effects of supposedly motiva- 
tional variables. Contrary to general 
belief, the impairment of performance 
by failure does not necessarily demon- 
strate its associative effects, nor 
does the improvement of perform- 
ance by failure demonstrate its 
drive effects. Just as the associative 
mechanisms attending failure may 
either benefit or harm performance, 
so might an increase in drive, in and 
of itself, affect behavior either favor- 
ably or adversely, depending on the 
specific nature of the task and the 
experimental conditions involved (16, 
48). Since few, if any, studies of the 
effects of failure have thus far been 
designed in a manner that permits an 
evaluation of its drive properties, the 
motivational status of this particular 
variable still seems somewhat doubt- 
ful. 


* Evidence concerning the reinforcing effects 
of escape from or avoidance of failure must not 
be neglected, of course, in considering this 
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Differentially Selected (Response-De- 
fined) Motives 


Logic of response-defined hypo- 
thetical constructs. The variables 
discussed so far have consisted of 
directly manipulable environmental 
conditions. Very often, however, 
motivational states are detined in 
terms of performance measures of 
one sort or another. In order to 
observe the effects of variations in 
motivation, thus defined, one must 
differentiate or select individuals who 
vary in terms of the particular re- 
sponse index chosen. If the response 
measure does reflect degree of drive, 
and if one has a theory concerning 
the way in which such variations in- 
fluence behavior, information con- 
cerning this response index for in- 
dividual Ss should enable one, in 
specific situations, to predict their 
behavior more accurately than one 
could do otherwise. 

As Spence has pointed out (45), the 
usefulness of response-inferred theo- 


retical constructs is in some ways 
limited, and, in principle at least, it 
should be possible, given sufficient 


information concerning antecedent 
conditions, both historical and con- 
temporaneous, to explain or predict 
the very behavior used as the index 
of the hypothetical state under con- 
sideration. However, when informa- 
tion concerning these antecedent con- 
ditions is meager or unavailable, the 
use of responses as indices of inter- 
vening variables is clearly advantage- 
ous. The importance of response-in- 
ferred constructs has been particu- 
larly stressed by those psychologists 
who insist that behavior cannot be 
predicted at all without ascertaining 





problem. Ir seems probable that failure or 
anticipation of failure can be profitably con- 
ceptualized in terms of a state resembling 
anxiety, which, when defined in certain ways, 
seems clearly to qualify as a motive (15). 
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the “meaning,” or S’s “perception” 
of a situation. In practice, they use 
certain responses, usually verbal, as 
indices of a “psychological environ- 
ment,"’ or ‘‘phenomenological field,” 
or some other hypothetical structure, 
to which they assign a primary ex- 
planatory role. From an S-R point 
of view there can be no quarrel with 
this procedure, provided that the re- 
sponses from which the hypothetical 
states are inferred are explicitly and 
unambiguously identified. However, 
as Bergmann observes, it is ‘‘. . . the 
business of science to ascertain which 
objective factors in the past and pres- 
ent states of the organism and its en- 
virortment account for the difference 
in response...” (4, p. 133), includ- 
ing, presumably, the response(s) from 
which one infers the “meaning’’ for 
S of any given situation. 

Of course, statements of relations 
between one kind of behavior and an- 
other, which Spence (45, 46) calls 
“R-R laws,’ need not involve any 
theory about motivation, or, for that 
matter, any other hypothetical state. 
But if a given response measure is 
labeled an index of motivation, and 
if this labeling is to serve any useful 
purpose, one’s theory concerning the 
relation of motivation to behavior 
ought to be made explicit. Other- 
wise, there is no justification for con- 
sidering the response in question to 
be a measure of motivation rather 
than anything else. 

It is instructive, from this point of 
view, to examine the many response 
measures that have been proposed as 
indices of motivational states. When 
this is done, one finds a very meager 
basis indeed for supposing that the 
majority reflect anything but asso- 
ciative tendencies. Although such 
findings do not in any way minimize 
the importance of these instruments 
or the empirical relations established 
in connection with them, they empha- 
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size the necessity for formulating ade- 
quate criteria of motivation. 
Allport-Vernon Study of Values. 
The problem of the motivational 
status of response-defined variables 
has received a good deal of attention 
recently in the specific case of per- 
formance on the Allport-Vernon 
Study of Values. As a result of the 
able analyses by Solomon and Howes 
(44) and Postman (36), it seems evi- 
dent that variations in scores on this 
particular instrument are much more 
reasonably interpreted in terms of 
variations in the habit strength of 
certain verbal responses than in 
terms of variations in drive factors 
per se. Studies by these investigators 
have demonstrated empirically that 
the relative frequency of usage of 
words in the various value areas com- 
prising the Allport-Vernon scale is 
the most important determinant of 
recognition thresholds for words as- 
sociated with these value areas.® But 


on theoretical grounds alone it could 
be concluded that a response index 
that does nothing but predict the 
relative strength of other specific re- 
sponses, and particularly those highly 
related to the items comprising the 
index, does not qualify as a measure 


of drive. There is no question that 
the Allport-Vernon Test measures 
something; but, from the present 
point of view, there is no reason to 
suppose that the “something” is a 
motive. 

Achievement imagery. Recently, 
McClelland, Atkinson, Clark, and 
Lowell (27) have published an im- 
portant analysis of motivation, cen- 
tered about their experimental in- 
vestigations and those of their col- 
leagues and students, of achievement 


5 This does not mean that variations in the 
habit strength of the verbal responses might 
not themselves be attributable to variations 
in drive. This possibility, however, would 
require independent verification. 
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imagery, as measured by the con- 
tents of stories given in connection 
with TAT-like pictures. The results 
of these investigations have indicated 
that Ss who vary in their achieve- 
ment imagery scores tend to differ, 
under certain conditions, in various 
other kinds of performance, including 
verbal. The predictive usefulness of 
this behavior index is clearly attested 
to by such findings, but it is not en- 
tirely certain that they demonstrate 
that the test reflects motivational 
characteristics. 

Thus, the obtained relations be- 
tween achievement imagery scores 
on the one hand, and visual recogni- 
tion thresholds for achievement-re- 
lated words or the frequency of usage 
in a written essay of certain terms 
(see 27, p. 252) on the other, seem to 
be precisely those that would be ex- 
pected on the assumption that varia- 
tions in achievement scores reflect 
variations in the strengths of certain 
verbal habits. In other words, the 
interpretation by Solomon = and 
Howes and by Postman of the be- 
havioral correlates of scores on the 
Allport-Vernon Scale may apply with 
equal cogency to these results also. 

The finding that increment of out- 
put with practice in a Scrambled 
Words Task was positively related to 
achievement scores is given particular 
weight by McClelland and his col- 
leagues in arguing that scores on their 
test are measures of motivation. 
They point out that, for some theo- 
rists, ‘“‘... the decisive criterion for 
determining whether a motive is in- 
volved in performance is whether or 
not it can produce learning. Since 
there is clear evidence of learning in 
the high m Achievement group, it can 
be argued that the n Achievement 
score is a measure of motivation” 
(27, p. 234). From the present point 
of view this reasoning is not very con- 
vincing. Although motivation is held 
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to be essential to learning, it does not 
follow from this assumption alone 
that, with other factors presumably 
held constant, better performance in 
learning situations is necessarily due 
to a higher drive level. Indeed, Hull's 
theory leads to the prediction that, 
under certain learning conditions, 
Ss with higher drive will perform 
more poorly than those with lower 
drive (16, 33, 38, 48, 54). 

The demonstration that the 
amount of food-related content in 
Ss’ stories tends to be positively re- 
lated to number of hours of food dep- 
rivation has also been used by the 
Wesleyan group as evidence of the 
influence of drive factors upon the 
kind of imaginative productions from 
which they have derived their n 
Achievement scores. Since hunger, 
defined in terms of number of hours 
of food deprivation, qualifies as a 
drive by virtue of the demonstration, 
in other studies, of its enhancing and 
reinforcing properties, it is possible 
that the amount of food-related con- 
tent in TAT stories reflects the drive 
aspect of hunger directly. But, as the 
discussion in an earlier section of this 
paper has indicated, an alternative 
interpretation in terms of the verbal 
habits associated with hunger is also 
clearly possible. 

In general, the authors of The 
Achievement Motive assume that re- 
sponses in fantasy situations are rela- 
tively unaffected by variations in 
habitual or cognitive factors: 

That is, for most subjects putting thoughts 
into words or verbalizing is a highly over- 
learned response [and is therefore a ‘‘normal”’ 
habit, whose strength is substantially similar 
in most individuals]. Furthermore, in the 
fantasy situation no particular set of responses 
is supposed to be perceived as especially ap- 
propriate. Fantasy is a “‘free’’ response situa- 
tion, provided the picture is not too struc- 
tured. It might not be for a certain class of 
persons, for professional writers, for example, 
because they may have learned a particular set 
of responses to use in such a situation... . 
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But except for professional authors, individuals 
should have no particular set of verbal re- 
sponse tendencies which seem appropriate 
because of past experience with such situa- 
tions (27, pp. 40-41). 


Although it is somewhat difficult 
to evaluate the merits of this argu- 
ment, since the meaning of the con- 
cept, “normal habit,” is not alto- 
gether clear, there seems to be plenty 
of evidence that stimulus factors and 
specific past experiences play at least 
some role in determining TAT per- 
formance (e.g., 11, 14, 39). The fac- 
tors determining the kind and 
amount of influence of associative 
variables upon performance in fan- 
tasy situations like that presented by 
the TAT are certainly not so well 
known as those determining the 
kinds of verbal performance tradi- 
tionally studied in the laboratory. At 
present, therefore, the supposition 
that variations in performance on so- 
called “projective” tests do not re- 
flect variations in associative factors, 
like the assumption that they do, 
appears to owe a greater debt to faith 
than to empirical fact. 

Iowa Picture Interpretation Test 
(IPIT) achievement scores. Despite 
the foregoing demurrals, the exten- 
siveness of the range of responses ap- 
parently affected by the achievement 
variable, as defined by the Wesleyan 
group, lends some credibility to the 
supposition that achievement image- 
ry scores may indeed reflect strength 
of drive. Among the studies of verbal 
behavior, the most important, in this 
connection, are those that demon- 
strate a relation between achieve- 
ment scores and other kinds of verbal 
behavior whose content has no obvi- 
ous relation to that from which the 
achievement scores are themselves 
derived. 

If achievement imagery scores are 
positively related to strength of 
drive, it could be predicted, on the 








MOTIVATION IN VERBAL LEARNING AND PERFORMANCE 


basis of Hull’s (23) assumptions con- 
cerning drive combination, that the 
introduction of an experimental vari- 
able calculated to increase drive 
would have less effect on the behav- 
ior of Ss with high achievement scores 
than it would have, in general, on the 
behavior of those with low achieve- 
ment scores (cf. 6 for a similar deduc- 
tion in regard to frustration). Stated 
somewhat differently, a correlation 
might be expected under ordinary 
conditions between achievement im- 
agery scores and performance. The 
addition of another drive variable, 
however, ought to reduce this correla- 
tion. 

To test this hypothesis, Hurley 
(24) recently administered the 1PIT 
to a large number of Ss. In this test, 
Ss are given a choice of four inter- 
pretations, or imaginative descrip- 
tions, of each of ten TAT pictures. 
One of the four interpretations of 
each picture involves achievement 
orientation, as judged by clinical 


psychologists on the basis of a defini- 


tion taken from The Achievement 
Motive. The other three interpreta- 
tions for each picture were designed, 
respectively, to indicate hostility, in- 
security, and blandness. 

When Ss were given instructions 
intended to be poorly motivating, a 
significant positive relation was found 
between achievement scores and the 
number of correct anticipations in a 
serial rote learning task. Under high 
motivational instructions, however, 
the correlation was insignificant.’ 
There was also some tendency for 
achievement scores to be positively 


* This was made possible by the generous 
cooperation of Dr. J. W. Atkinson, who 
furnished a prepublication copy of the manu- 
script. 

7 The difference between the two correla- 
tions was significant at p<.07. The higher 
level of significance reported for this com- 
parison in the original version of this paper was 
based upon a smaller N. 
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correlated with number of erroneous 


anticipations under the low motiva- 
tional instructions. This result, 
though not statistically significant, 
was similar to a finding reported by 
Lowell (see 27). 

Although Hurley's results were in 
accord with predictions made on the 
assumption that achievement scores 
reflect degree of motivation, they are 
also consistent with the notion that 
achievement scores reflect a general 
response tendency influencing per- 
formance in a wide variety of situa- 
tions. Ss with high achievement 
scores may simply have the habit of 
being or at least appearing ‘‘eager.”’ 
In other words, they may have 
learned certain work habits of an “‘ef- 
fortful”’ or “active” sort. The distinc- 
tion between drive-produced activity 
and learned activity has been nicely 
brought out by Miller (32), who 
points out that rats can be /rained to 
be extremely active when hungry if 
pellets of food are widely scattered 
about a maze. Animals trained in this 
way would presumably be more ac- 
tive (‘‘eager-appearing’’) than un- 
trained animals, even though the 
latter might actually be hungrier. In 
like manner, human Ss appear to 
vary, as a result of learning, in the 
degree of obviousness of their desire 
to succeed. Under low motivational 
instructions Ss having relatively 
strong habits of this sort should tend 
to produce both more correct and in- 
correct responses. When the instruc- 
tions themselves produce the ‘“‘eager- 
ness’’ response, the differences be- 
tween the two groups might be ex- 
pected to disappear. It is far from 
certain, then, that the term “achieve- 
ment motive’’ is appropriate, at least 
when applied to the indices here con- 
sidered; “achievement-oriented hab- 
it,’’ though more awkward, may be 
more accurate. 


IPIT hostility scores. Certain of 
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Hurley's results suggest that the per- 
sonality (response-defined) variable 
operating in his study may actually 
have been hostility, rather than 
achievement-orientation. He found a 
highly significant negative correla- 
tion between the hostility scores on 
the IPIT and the number of both cor- 
rect and erroneous anticipations in 
verbal learning, under low motiva- 
tional instructions. This correlation 
disappeared under the highly moti- 
vating instructions, the difference be- 
tween the two correlations being sig- 
nificant. Since the correlation be- 
tween scores on the achievement and 
hostility indices of the [PIT was high 
(r= —.71), the relation of each score 
to performance was evaluated by us- 
ing partial correlations. When the 


effect of the hostility score was elimi- 
nated, the partial correlations be- 
tween achievement and performance 
were reduced to insignificance. How- 
ever, when the effect of achievement 
imagery scores was partialled out, 


the negative correlation between hos- 
tility and number of correct and total 
(correct plus incorrect) anticipations 
under the low motivational condi- 
tions remained significant. 

It is possible that high hostility, as 
measured by the IPIT, itself reflects 
a drive state. Since high D may, un- 
der certain conditions, be expected 
to produce poorer verbal perform- 
ance (see below), such an assumption 
would not be inconsistent with Hur- 
ley's results. But this would imply 
that high achievement scores re- 
flect low D. It seems simpler and 
more plausible, as Hurley himself 
concluded, to attribute the relations 
between the hostility scores and per- 
formance to differences in habit tend- 
encies rather than to drive. 

Taylor Anxiety Scale (A-scale) 
scores. A number of investigators in 
the lowa Laboratory and elsewhere 
have been expressly concerned with 
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the experimental and _ theoretical 
problems involved in testing the sup- 
position that a given response meas- 
ure reflects drive. Using an anxiety 
questionnaire originally constructed 
and recently revised by Taylor (52, 
53), these investigators reasoned as 
follows: If Ss’ scores on this test re- 
flect general drive level, and if varia- 
tions in drive affect response strength 
in the manner suggested by Hull (23), 
then the performance of these Ss 
should differ in certain predictable 
ways in given experimental situa- 
tions. 

Since the studies were for the most 
part deliberately restricted to a con- 
sideration of nonassociative charac- 
teristics on the view that this pro- 
cedure is most appropriate ina test 
for motivational properties, the as- 
sumption figuring most prominently 
in these predictions was that all habit 
tendencies elicited in a given situa- 
tion are multiplied by the value of 
the total effective drive then operat- 
ing. On this assumption, it can be 
shown that in learning situations in 
which correct responses are highly 
dominant, Ss with a relatively high 
drive level should perform better 
than those having a low drive level.* 

If, however, erroneous response 
tendencies are stronger than the cor- 
rect ones, the performance of Ss with 
high drive should be poorer than that 
of Ss with low drive. Furthermore, 


* Contrary to the supposition by Lazarus, 
Deese, and Osler (26), Taylor's prediction that 
anxious Ss would give more conditioned eyelid 
responses than nonanxious Ss was based upon 
this multiplicative assumption (R=H XD), 
rather than upon the assumption that anxious 
Ss have the stronger associative tendency to 
make defensive reactions. In the case of sim- 
ple eyelid conditioning, it is not possible, of 
course, to decide between these two inter- 
pretations of Taylor’s original results (52). 
In other situations, however, the behavior 
predicted on the basis of these different 
hypotheses would presumably be quite 
different. 
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as Taylor and Spence (54) have 
pointed out, in complex situations, 
even though a correct response tend- 
ency might be slightly stronger than 
any alternative superthreshold tend- 
encies, an increase in drive could nev- 
ertheless increase the frequency of 
erroneous responses. This does not 
mean that the presence of competing 
tendencies would always put Ss with 
high drive at a relative disadvan.age, 
regardless of the strength of the cor- 
rect tendencies, but, rather, that the 
relative strengths of the correct and 
incorrect tendencies are not the only 
factors to be considered. Thus, ac- 
cording to Hull's theory, an increase 
in drive (D) will increase the excita- 
tory strength (£) of the strongest re- 
sponse more than that of the weaker 
ones, but will also increase the prob- 
ability that more of the weaker re- 
sponses will have superthreshold val- 
ues. If the increase in D does not in- 


crease the E of the strongest tendency 


so much that its downward oscilla- 
tion never brings it within the range 
of E’s for the weaker responses, the 
possibility of successful competition 
by weaker, but now superthreshold 
tendencies, may be increased. Clear- 
ly, there is no possibility of successful 
competition by the weaker tendencies 
if their strengths are subthreshold. 

A number of deductions that could 
be made on the basis of the foregoing 
theoretical analysis have been tested 
in various situations, including ver- 
bal learning (33, 38, 48). The results 
have indicated that the quality of 
performance in complex learning situ- 
ations is inversely related to Ss’ de- 
gree of anxiety, as measured by the 
Taylor scale, and furthermore, that 
the advantage of the nonanxious 
over the anxious Ss is positively re- 
lated to the probable number and 
strength of the competing responses 
elicited. At the moment of this writ- 
ing it does not seem altogether cer- 
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tain whether conventional verbal 
learning tasks can be made simple 
enough to provide a consistent ad- 
vantage for the anxious Ss. For in- 
stance, Montague’s (33) anxious Ss 
were superior to the nonanxious on an 
easy list, consisting of items having 
high association values and a mini- 
mum of intralist similarity, but this 
simple effect was not significant; only 
the interaction between anxiety and 
difficulty of list was significant. How- 
ever, more recent studies (see 48) in- 
dicate that it is possible to construct 
verbal learning situations that permit 
highly anxious Ss to reap the benefit 
to which their higher drive, if it in- 
deed exists, should entitle them un- 
der appropriate conditions. 

Theoretical implications of A-scale 
studies. It has not always been recog- 
nized that most of the Iowa studies 
of the relations between A-scale 
scores and various kinds of labora- 
tory performance have been exclu- 
sively concerned with the test of the 
possibility that the A scale measures 
drive level (D). From the present 
point of view, the test of the supposi- 
tion that the A scale or any other 
variable is a measure of D, or of mo- 
tivation, generally, must be made in 
terms of predictions based on purely 
nonassociative assumptions, a pro- 
cedure seldom if ever followed in the 
case of other response indices. It 
must be noted that this kind of test 
can be made only under certain con- 
ditions, namely, those in which the 
associative properties of the variable 
may be presumed to have little dif- 
ferential effect. Under such neces- 
sarily limited conditions, the general 
success of the predictions thus far 
may fairly be said to lend increased 
confidence in a drive interpretation 
of A-scale scores. 

This conclusion does not imply 
that A-scale scores reflect nothing but 
D. On the contrary, if they reflect a 











324 


motivational state, then in the pres- 
ent view, they also undoubtedly re- 
flect associative tendencies. Under 
many circumstances, therefore, Ss 
with relatively high scores should per- 
form differently from those with low 
scores by virtue of differences in 
their habits. In such instances one 
could not expect to account for varia- 
tions in the behavior of anxious and 
nonanxious individuals in terms of 
drive level alone.* i 

The theoretical studies concerned 
with the A scale were never intended, 
then, to investigate all of its possible 
functional relations. Nor, contrary 
to the belief of some, was the A 
scale constructed in order to predict 
as well as possible either eyelid con- 
ditioning, or verbal learning, or 
motor skill, or any other sort of per- 
formance. If this had been so, a very 
different procedure would un- 
doubtedly have been followed. If one 
wished to use a self-rating question- 
naire, one would administer a very 
large number of items, obtain the cor- 
relation of each with the behavior to 
be predicted, and then, after cross 
validation and a consideration of the 


* From the standpoint of these considera- 
tions, Child’s (9) criticisms of the Iowa stud- 
ies for failing to investigate the associative 
aspects of manifest anxiety do not seem to 
have been well-taken. Although the motiva- 
tional status of anxiety could not be demon- 
strated by pointing to differences in the habits 
of anxious and nonanxious Ss, there is no 
doubt that such differences exist. On the 
other hand, despite considerable speculation 
and some empirical evidence (e.g., 3, 18, 19, 
55, 56), it is not as yet entirely certain what 
these habitual differences may be, apart from 
the trivial observation that they consist, at 
least in part, in the kinds of verbal responses 
given on the test itself. For instance, despite 
some supporting evidence (20), it is doubtful, 
on either theoretical or empirical grounds, 
(49, 50) that a consideration of the specific 
nature of anxiety responses warrants the be- 
lief that anxious Ss are (habitually ?) poorer 
than nonanxious Ss in discriminating between 
threatening and nonthreatening stimuli. 
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item intércorrelations, select the set 
that most efficiently predicted the 
criterial measure. A test constructed 
in this manner would unquestionably 
predict the specific kinds of behavior 
it was designed to measure better 
than the A scale could be expected 
to do. 

It is certainly not surprising that 
various kinds of empirically selected 
measures excel the A scale as pre- 
dictors of certain kinds of behavior. 
But since the A scale was not devised, 
nor ever considered by those asso- 
ciated with its construction, to be a 
measure of all of the variables in- 
fluencing performance, some sort of 
limit is to be expected, under most 
conditions, in the amount of variance 
in performance that is associated with 
scores on the A scale. It is only in 
certain kinds of controlled situations 
that variations in performance are 
likely to be primarily due to varia- 
tions in drive. 

A misconception somewhat related 
to the notion that the A scale was in- 
tended to be a “best predictor” of 
some kind is that the ‘essential’ 
validity of the Taylor A scale de- 
pends upon its correlation with other 
indices of anxiety, e.g., those of a 
clinical or psychiatric nature. There 
is some evidence, as a matter of fact, 
that scores on this test do relate to 
clinically diagnostic measures, in- 
cluding psychiatric ratings (17). Such 
relations, when they are found, are 
interesting and useful (though per- 
haps not so useful as one might wish, 
in view of the notorious unreliability 
of psychiatric judgments), but they 
most emphatically do not constitute 
a validation of the test in respect to 
the purpose it was originally designed 
to serve. As Bechtoldt (3) has re- 
cently observed, the question of the 
validity of the Taylor scale as a useful 
definition of general drive level is an- 
swered by the accuracy of the predic- 
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tion of relations between this scale 
and specified behavior variables, un- 
der conditions such that variation in 
the behavioral variables can be rea- 
sonably attributed to differential 
drive levels. 

In view of this, the Taylor ques- 
tionnaire might better have been 
termed a test of “excitability,”’ or 
“level of responsiveness,” or simply, 
in accordance with the present usage, 
an “A scale.” Nevertheless, the 
items of the scale were selected by 
clinical psychologists in accordance 
with the specifications for manifest 
anxiety laid down by a descriptive 
psychiatrist of high repute (8). It 
does not seem inappropriate, con- 
sequently, to consider this test an in- 
dex of manifest anxiety as well as D. 


SUMMARY 


It is the major thesis of the present 
paper that the question as to whether 
a given variable acts as a motive or 
reflects a motivational state can be 
answered only in the context of ex- 
plicit assumptions concerning the 
particular ways in which motiva- 
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tional variables can be differentiated 
from other variables in their influence 
on behavior. In view of the current 
confusions between the concepts of 
motivation and habit, it would ap- 
pear to be particularly desirable to 
distinguish between the associative 
and nonassociative properties of vari- 
ables. Thus, experimentally manipu- 
lated or environmentally defined var- 
iables such as electric shock, hunger, 
fear, and failure may be classed as 
motivational by virtue of their non- 
associative (drive) properties; never- 
theless, their effects upon verbal 
learning and performance in some 
situations are best understood in 
terms of their associative character- 
istics. With a very few exceptions, 
the influences of differentially se- 
lected or response-defined variables 
frequently classified as motivational 
also seem attributable, for the most 
part, to their associative properties; 
in the absence of convincing evidence 
that such variables have anything 
other than an associative influence, it 
is doubtful whether they qualify as 
motivational at all. 
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Approximately 22 years of the av- 
erage man’s life is lost in sleeping. 
Economically minded persons and 
harassed students have long searched 
for some means to use this time to 
further advantage. For nearly a 
third of a century, now, there has 
been a growing interest in the possi- 
bility of trying to learn while one 
sleeps. 

The science fiction writers were 
among the first to propose sleep- 
learning as an educational technique. 
In 1911, a magazine called Modern 
Electrics published a fiction story by 
Hugo Gernsback (4) in which the 
hero learned during sleep. The proc- 
ess was simple. Upon retiring, the 


student placed the material to be 
learned on a machine which played it 
automatically while he slept. In 1932, 


Aldous Huxley (9), in his Brave New 
World, scoffed at the possibility of 
improving the S’s intellect while he 
slept, but described the world of the 
future as one which utilized the sleep 
period to train the lifelong attitudes 
of its populations. Other science fic- 
tion writers have ,continued to use 
the idea of sleep-training in their 
stories. 

Popular news and picture maga- 
zines, citing experimental evidence, 
constantly reinforce the public's in- 
terest and misinformation about this 
osmotic form of education. From 
time to time, a national news service 
will release a story telling of some 
individual who works for a living by 
day and studies chemistry, operatic 
arias, or other lessons while sleeping 
at night. On ‘People Are Funny," a 
popular radio stunt show, an audience 
participant learned a few short sen- 
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tences and a girl’s name while he was 
apparently asleep on the stage be- 
fore the audience. 

Following closely on the heels of 
public interest have come the entre- 
preneurs. Commercial firms have 
sprung up throughout the country 
selling recording devices which auto- 
matically turn on and off to permit 
the individual to learn while he sleeps. 
Impressive publications have been 
distributed by many of these firms 
propagandizing their products and 
extolling the validity and value of 
sleep-training (19, 21). On the whole, 
their claims of success for sleep-train- 
ing have been extrapolations beyond 
the available supporting data. The 
majority of claims have been based 
on distorted facts, statements by un- 
qualified authorities, and armchair 
hypotheses. Noncritical use of the in- 
formation, anecdotal evidence, and 
the citing of inadequate research, 
much of which will be reviewed in 
the present paper, have made these 
commercial publications poor cri- 
teria for the support of the validity of 
the sleep-learning process. 

In its constant search for new 
methods to speed training during na- 
tional emergencies, the military, too, 
has tried to sleep-train its personnel. 
In both the first and second World 
War, exploratory efforts were made 
to teach service men Morse code'? 
during their sleep. A VA Hospital 
doctor reported the successful use of 
sleep-training as a supplement to 
waking-training in a mental health 


'Comdr. R. R. Humes, USN. 
communication. January 25, 1954. 

2. L. Thurstone. Personal communica- 
tion. October 19, 1953. 


Personal 
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program (20). Unfortunately, most 
of these incidents have been too in- 
adequately controlled to judge their 
effectiveness during sleep, and in gen- 
eral their results have not been suf- 
ficently striking to justify the extra 
energy and expense required to carry 
them out. 

It is the scientist who has been 
slow in investigating these claims for 
successful sleep-learning. Although 
psychologists have been concerned 
with problems of learning from their 
formalized beginning, there has been 
relatively little interest in the psy- 
chological processes which take place 
during sleep and a rather skeptical 
although untested—attitude toward 
the value of studying sleep-learning. 
One of Pavlov’s students, Krasno- 
gorski (11), did attempt without suc- 
cess to condition the salivary reflex in 
a young child during sleep. Recently, 
however, a resurgence ol interest in 
the problem has occurred. in the 
present paper the available research 


on the sleep-learning phenomenon 
has been collected and subjected to a 
critical evaluation. 

Ten such studies were found. Of 
these only three have been published 
(3, 12, 13). Five were academic theses 


at the master’s or bachelor’s level 
(1, 2, 5, 8, 23) of which one was read 
at an APA meeting (23). The remain- 
ing two?’ were described in private 
communications from the authors. 
Although a number of other refer- 
ences were found, insufficient details 
were available for an adequate review 
or else they could not be classified as 
research under even the broadest defi- 
nition. 

The criticisms in this paper are not 
for the purpose of belittling these 
pioneer studies. These criticisms are 
believed necessary since these stud- 


*L. LeShan. 
October 27, 1953. 


Personal communication 
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ies —many of which are unpublished 

are being cited and their conclu- 
sions accepted both by the general 
public and some scientists. Because 
of the unusual methodological prob- 
lems in this area of research, a review 
may help future workers to avoid 
similar mistakes. 


Description of the Sleep-Learning 
Studies 


The experiments reviewed in this 
paper are described briefly below. In 
some of the studies, Ss were elimi- 
nated for various reasons. Only the 
number of Ss actually completing the 
experiment is reported. 


Thurstone,? in 1916, supplemented the 
waking training of 16 Navy men in a 
Morse code course with material pre- 
sented during sleep. The criteria of sleep 
used in this study are unknown. The Ss 
finished the course three weeks earlier 
than had been expected. Thurstone con- 
cluded that the results “indicated some 
gain’’ for the sleep-trained group in their 
ability to send and receive Morse code. 

LeShan (12), in 1942, tried to break the 
fingernail-biting habit of 20 boy campers, 
ages 8 to 14, by playing the phrase, “My 
fingernails taste terribly bitter,’’ through 
a loudspeaker 300 times a night for 54 
nights. The Ss were asked if they were 
awake before turning on the input; if the 
Ss appeared restless, the input was turned 
off. The nails of the Ss in the experi- 
mental group and an equal number of Ss 
in a control group were checked ever) 
two weeks for eight weeks for signs that 
the nail-biting had stopped. Since 40 per 
cent of the experimental group and none 
of the control group stopped biting their 
nails, LeShan felt this indicated “the 
possible therapeutic use of suggestion 
during sleep.”’ 

LeShan* in 1943, taught a single S a 
different list of nonsense syllables each 
morning for 12 days. On the fifth and 
eighth nights, the list to be learned the 
next day was repeated 50 times while S 
slept. The criteria of sleep used in this 
study are unknown. Fifty per cent fewer 
trials were required to learn the two 
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sleep-trained lists than to learn the ten 
nonsleep-trained lists. LeShan_ con- 
cluded that the sleep-training facilitated 
learning. 

Elliott (2), in 1947, studied the effects 
of 30 repetitions of a list of 15 common 
three-letter words during sleep on the 
per cent saved in learning the list by the 
anticipation method the following day. 
Two groups of 20 male college students 
of equal learning ability spent one night 
adapting to the laboratory environment. 
The word list was repeated serially to 
one group of Ss while they slept. An 
EEG record was used as a criterion of 
sleep. No material was played when the 
S showed “clear alpha patterns’’ al- 
though the EEG was turned off when 
E believed S would stay asleep. On 
awakening the next morning, the two 
groups learned the sleep-trained list by 
the anticipation method and were asked 
if they awoke during the night. The 
group receiving the sleep-training showed 
a significantly greater percentage of sav- 
ings than the control group (p=.05).‘ 
No significant differences between groups 
were found on the basis of errors or abso- 
lute number of trials to learn. Elliot con- 
cluded that there is some retention of 
auditory material during sleep. 

Hedges (5), in 1950, tried to improve 
the speech of three mentally retarded 
and aphasic children (ages 11, 74, and 6) 
by sleep-learning. Short sentences, 
paragraphs, and simple consonant sounds 
were played to the children between 433 
and 1501 times distributed over 7 to 13 
nights. The machine was not turned off 
if an S awoke. The third child required 
fewer waking lessons to learn the sleep- 
trained consonant than it had required 
previously to learn a nonsleep-trained 
consonant. The babbling of the first 
child increased. The second child showed 
no effects. Hedges interpreted this as 
showing “the possibility of perhaps 
speeding’’ the training of the third S. 

Fox and Robbins (3), in 1952, taught 
30 college males and females a list of 25 


‘The p values representing the results of 
tests of significance are all reported as if they 
were based on a two-tailed ¢ test, irrespective 
of what the author used. The p levels greater 
than .15 were not reported. 


ans! 


CHARLES W. SIMON AND WILLIAM H. EMMONS 


pairs of English-Chinese vocabulary. 
The Ss were divided into three groups, 
matched on a pretest. Two vocabulary 
lists were prepared; in the first list, word 
pairs were matched as they were to be 
learned on the posttest (the facilitation 
list), while in the second, they were mis- 
matched (the interference list). These 
lists were played 15 times to two of the S 
groups approximately three hours after 
they had retired to go to sleep. The third 
group—a control—heard only music 
while they slept. Although Es did not ob- 
serve Ss during the input period, they 
disqualified all Ss who said they had 
awakened during the night. The facilita- 
tion group required fewer trials and the 
interference group required more trials 
to learn the posttraining list than did the 
control (p <.001). The Es concluded that 
learning could occur during sleep and 
could be detected by the savings method. 

Leuba and Bateman (13), in 1952, be- 
lieved that they had taught the words of 
three songs, 8 to 27 lines long, to a lightly 
sleeping S. Each song, unknown to S be- 
forehand, was played to her five times a 
night for three successive nights as she 
slept. During the input period, S was 
not observed by E, although for three 
brief periods, S’s husband observed no 
restlessness on the part of the S. The S 
claimed she did not awaken during the 
night. Given only the song title the next 
day, S was able to write the lyrics of two 
out of the three songs with no errors and 
of one with only minor errors. The Es 
believed the sleep-training was success- 
ful. No learning occurred in a later study 
during and following the use of sedatives. 

Hoyt (8), in 1953, taught ten pairs of 
English-Chinese vocabulary to his Ss. 
Twenty Ss were matched on a pretest 
and given one night to acclimate to 
sleeping in the laboratory. On the ex- 
perimental night, one group of eight Ss 
received twelve repetitions of a list paired 
in the same manner as that to be learned 
the next morning (facilitation list). A 
second group of eight Ss received a list of 
comparable words, but with the pairs 
mismatched (interference list). The re- 


maining four Ss acted as a control group 
and received no sleep-training. During 
the input period, Ss were observed by E. 
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If they awoke or heard the stimulus ma- 
terial, they were to turn on a light. [If 
this occurred, or if they stirred, the re- 
cording was turned off until S lay quiet 
for a suitable length of time. The next 
morning Ss were asked if they had 
awakened during the night. Those that 
had not were given the paired vocabulary 
list to learn to two correct anticipations. 
Statistically insignificant differences were 
found to favor greater savings for the 
interference group rather than the facili- 
tation group in both the mean number of 
trials required to learn the list as well as 
the number of correct responses occurring 
before the criterion was reached. No 
comparisons were made with the control 
group. Hoyt felt that under the condi 
tions of this study, learning during sleep 
could not be detected. 

Stampfl (23), in 1953, had six college 
males learn lists of ten nonsense syllables 
while acting as their own controls. Dif- 


ferent lists were repeated 4, 8, 16, and 32 
times on different nights while Ss slept, 
and were learned to one correct anticipa- 
tion on the morning following the repeti 
tion of a particular list. At other times, 


the same Ss were tested on other lists 
after no sleep-training. Before presenting 
the stimulus material, Ss were asked if 
they were awake. During the input 
period they were watched, and if move- 
ments occurred, the input was stopped. 
Since no significant differences were 
found for savings in either trials or errors 
between learning sleep-trained and non- 
sleep-trained lists, no comparisons were 
made among performances after different 
numbers of training repetitions. Stampfl 
felt that the sleep-learning hypothesis 
was uncertain and improbable. 

Coyne (1), in 1953, carried out a series 
of exploratory studies on a variety of 
psychological problems using from four 
to six male college students as Ss. He 
believed the results to be generally favor- 
able for sleep-learning. Unfortunately, a 
statistical error was discovered in 
Coyne’s thesis after he had written his 
discussion and conclusions which led to 
a more favorable interpretation of the 
results than was justified. Although EF 
could observe S as he slept, he primarily 
depended on S to press a buzzer as soon 
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as he awakened during the night. The 
problems Coyne studied are described 
below. 

1. An interference list of adjectives 
was presented 25 times to the sleeping S. 
A similar list was learned to one perfect 
anticipation the next merning. A con- 
trol group had no sleep-training. The 
sleep-trained group did poorer than the 
control group (p=.07). 

2. When the first problem was re- 
peated using S as his own control, more 
errors were made on the list which had 
received the interfering sleep-training 
(p=.15). No difference in savings (trials 
to learn the posttraining list) was ob- 
served. 

3. Twenty pairs of numbers and words 
were presented 24 times during a single 
45-minute period to the sleeping Ss. 
These same Ss also received another list 
for the same number of repetitions dis- 
tributed over a four-hour period. On 
subsequent mornings, Ss were required 
to answer the appropriate word when 
given the number found on the list on 
which they were trained during the night, 
after that list had been mixed with ten 
additional new number-word pairs. A 
greater percentage of sleep-trained words 
were associated with the correct numbers 
than those words not trained during 
sleep (p=.15). Distributed sleep-learning 
resulted in fewer recall errors than did 
massed sleep-learning (p=.05). 

4. While the Ss slept, one list of ad- 
jectives was repeated 100 times; on a 
different night, another list was repeated 
25 times. On the morning following the 
sleep-training, the list was learned to one 
perfect repetition. No significant differ- 
ences were found in performance between 
the two amounts of training. 

5. The Ss were required to solve a 
number of problems by finding E’s solu- 
tion of a concept composed of the correct 
combination of letters, colors, and their 
relative positions. The solution to one 
problem had been repeated 180 times 
while S slept the night before. Insignifi- 
cant differences were found favoring the 
performance on the sleep-trained prob- 
lem. 

6. While Ss slept, one list of nonsense 
syllables was presented 30 times begin- 





332 


ning at two o'clock and, on a different 
night, another list was presented 30 
times beginning at five o'clock. Since it 
was discovered that the two lists were of 
unequal difficulty, comparisons between 
performance at different times of pres- 
entation were not made. 

7. During sleep, the contents of a 
particular picture were described to Ss 
90 times. The following morning, a num- 
ber of out-of-focus pictures were shown, 
including the one described during sleep. 
The degree of focusing required before S 
could identify the pictures was the meas- 
ure of learning. No significant differences 
in the case of identifying sleep-trained 
and nonsleep-trained pictures were found 
either when Ss were asked to give un- 
aided responses or when multiple-choice 
solutions were provided. 

Rather than examine each of the 
ten sleep-learning studies independ- 
ently, this paper has been organized 
to discuss them all on the basis of the 
following categories: experimental 
design, statistical considerations, 
methodological considerations, and 
sleep criteria. 


EXPERIMENTAL DESIGN 


The use of a control group or of 
using S as his own control is recog- 
nized as a necessary procedure in or- 
der to know whether a certain experi- 
mental effect is real or not. In a num- 
ber of the sleep-learning studies, how- 
ever, both of these techniques were 
conspicuously absent or inadequately 
handled. 

Thurstone (see footnote 2) recog- 
nized the inadequacy of his uncon- 
trolled experiment teaching Navy men 
Morse code and attempted to run a 
second study in order to compare the 
performance of one group which re- 
ceived sleep-training with another 
which did not. This study was dis- 
continued, he reported, when it was 
discovered that ambitious Navy in- 
structors of the control group had 
been giving extra daytime instruction 
in order not to be out-taught. 


CHARLES W. SIMON AND WILLIAM H. EMMONS 


Hedges (5) used no controls with 
two of the three speech-defective 
children he attempted to sleep-train. 
Since improvement in speech is a 
maturational problem, even for re- 
tarded children, the need for a con- 
trol was paramount. The increased 
babbling of the first child may have 
been independent of the sleep-train- 
ing and due solely to an additional 
month or two of growth, although 
Hedges believed that it took place 
immediately after the introduction of 
the sleep-training. Since S also re- 
ceived waking-training on the same 
material, it is impossible to know to 
what extent one can attribute the in- 
crease in babbling to sleep-training. 
No apparent learning took place with 
the second S. Although Hedges’ 
third S acted as his own control by 
learning to pronounce one consonant 
with sleep-training added to his wak- 
ing-training and another consonant 
without the sleep-training, this was 
an inadequate control measure since 
the experimental design was such 
that the sleep-trained consonant fol- 
lowed the nonsleep-trained conson- 
ant. Any improvement in the latter 
could be attributed to practice as well 
as to maturation. 

Nor was a control used in the work 
of Leuba and Bateman (13). In this 
study, S presumably had no previous 
knowledge of the songs played to her 
during sleep, yet was able to write 
the lyrics without further training on 
awakening. If this were the case, any 
learning which took place during 
sleep would be a significant improve- 
ment, although materials such as 
songs and poetry probably have some 
internal predictability. This form of 
experimental design represents an 
implied control. It is implied that S 


acted as her own control for had she 
been tested previously, no appropri- 
ate responses could have been made, 
nor could one suspect that matura- 
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tional factors were operating to pro- 
duce positive results. 

A number of Es actually used their 
Ss as their own control. In LeShan’s 
(footnote 3) second study, he com- 
pared the number of trials an S re- 
quired to learn a list with and with- 
out sleep-training over a period of 
days. By having the nontrained peri- 
ods before and after the sleep- 
trained periods, the superiority of the 
sleep-trained lists could not be at- 
tributed simply to practice or matu- 
ration. On many of his studies, 
Coyne (1) gave sleep-training to his 
Ss one night and no sleep-training on 
the next, counterbalancing the order 
for these procedures between two 
groups of Ss. However, when the S 
acts as his own control, either the 
study material which is used under 
the varying conditions must be care- 
fully equated, or additional counter- 
balancing must be introduced into 
the experimental design to correct 
No E used this 
some made 


for the inequalities. 
counterbalanced design; 
use of the published tables on which 
similarities and associability of the 
stimulus material had been previ- 


ously calculated for the items. Sev- 
eral of Coyne’s (1) studies were un- 
analyzable after he discovered the 
study material had not been equated. 

Four of the experiments were de- 
signed to use separate control groups. 
In Elliott’s (2), Fox and Robbins’ 
(3), and Hoyt’s (8) studies pretesting 
took place in order to equate the 
mean performance of the control and 
experimental groups. LeShan (12) 
used unmatched groups in his study 
with fingernail-biting children. The 
median age of his twenty experi- 
mental nailbiters was slightly less 
than ten years. His control was di- 
vided into two groups consisting of 
eight nailbiters with a median age of 
nine years, and twelve more nail- 
biters with a median age of twelve. 
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Some question might be raised con- 
cerning LeShan's failure to better 
equate the experimental and control 
groups on age, for there is reason to 
suspect a relation between nail biting 
and age. Wechsler (25) found a 
sharp rise in nail biting for boys 
around the age of twelve; if this is so, 
there would be a lower probability 
for the older control group to stop 
biting its nails than the younger ex- 
perimental group, thus reducing the 
effectiveness of the older group as a 
control. 

Two of the experimenters added 
a second experimental group to their 
design. Fox and Robbins (3) and 
Hoyt (8) used both a facilitation and 
an inhibition group in order to obtain 
a more sensitive indication of the 
value of sleep-training. Of all of the 
studies, only that of Fox and Robbins 
(3) provided the control group with 
a neutral stimulus—music—for the 
same amount of time as that in which 
the experimental group received the 
verbal test material. If there are any 
disturbances during sleep due to the 
stimulus and if these in turn affect 
recall, such a procedure is a wise one. 

Although none of the Es were di- 
rectly concerned with the problem, 
the use of an additional control group 
to compare results from sleep-train- 
ing with the equivalent amount of 
waking-training would have been 
quite illuminating. 


STATISTICAL CONSIDERATIONS 


Only five of the #s (1, 2, 3, 8, 23) 
treated their data statistically to see 
if the sleep-training improved per- 
formance significantly. The remain- 
ing /s used clinical criteria to evalu- 
ate the effects of sleep-training. 

Although Elliott’s (2) results fa- 
vored the performance of the sleep- 
trained group over the nonsleep- 
trained group, he failed to find a sig- 
nificant difference at the 5 per cent 
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level in the number of trials it took to 
relearn the training list. Elliott had 
equated his groups on a pretest, but 
did not attempt to match the indi- 
vidual Ss for the analysis. Since 
equating tends to decrease differ- 
ences between means, failure to re- 
move the variance due to Ss inflates 
the error variance and decreases the 
probability of getting significant dif- 
ferences. When the present authors 
(22) did an analysis of covariance 
with pre- and posttest scores from 
Elliott's data, the differences between 
the mean number of trials to learn a 
new list by sleep-trained and non- 
sleep-trained Ss were significant be- 
low the .05 probability level. 

Coyne (1) used the one-tailed t test 
to evaluate his data. The wisdom of 
this treatment is questionable. In 
order to avoid abuses and contro- 
versy, exploratory work should be 
as cautious in its interpretations of 
results as it should be daring in at- 
tempting new ideas. Using the one- 
tailed ¢ test does not allow for the 
possibility that differences might be 
in the direction opposite to that hy- 
pothesized (6). This is serious in any 
exploratory work; it is particularly 
dangerous in sleep-learning studies 
where one could seriously suspect 
that the intervening training during 
sleep might actually hinder normal 
waking recall. 

Half of the Es (2, 3, 8, 12; footnote 
2) used a reasonably large number of 
Ss as compared to the number used 
in most psychological studies; in the 
remaining cases, the number was 
smaller. When the number of Ss is 
small, one might be more critical of 
accepting the null hypothesis merely 
because the level of significance was 
not below the traditional 5 per cent 
level. For studies as exploratory as 
these, significance levels of 15 per 
cent could be arbitrarily considered 
encouraging. Since the expense in 
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time and money is relatively small, 
preliminary work in sleep-training 
should favor fewer Type II errors in 
order not to reject valuable experi- 
mental leads by accepting a false null 
hypothesis. 


METHODOLOGICAL CONSIDERATIONS 


A number of Es believed that dif- 
ferences in methodology might have 
been responsible for the divergent 
successes and failures found in the 
sleep-learning studies. These and 
other considerations are discussed be- 
low. 


Subjects 


The majority of the studies em- 
ployed the traditional college student 
—male and female—as Ss. Thur- 
stone used Navy men and LeShan 
used young boy campers. Hedges (5) 
bravely attempted to provide sleep- 
training as a supplement to the wak- 
ing-training of children who had 
speech deficiencies and who were sus- 
pected of being mentally retarded. 
None of the Zs attempted to study 
the effects of either age or sex on 
sleep-learning. 

The selection of Ss may have an ef- 
fect on whether successful sleep-train- 
ing results are attained or not. Un- 
derwood, while reviewing Fox and 
Robbins’ paper in the 1953 Annual 
Review of Psychology, commented 
that “such low variability [on per- 
formance scores] among Ss on the 
test list is rarely found in normal 
transfer experiments with such ma- 
terial, but this may again only re- 
flect the presence of a highly select 
and homogeneous sample’ (24, p. 
48). Low within-group variability 
was certainly in part responsible for 
the high degree of significance of the 
differences between the sleep-trained 
groups and the control. 

Whatever the variability of the 
group, it would appear wiser to 
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choose individuals who have shown 
the capacity to learn while awake. 
Perhaps the effects of sleep-training 
are so subtle that its benefits will be 
found only when it is applied to indi- 
viduals with very high IQ's. 


Number of Repetitions 


The experiments can be divided in- 
to two groups on the basis of this vari- 
able—those who gave an exception- 
ally large number of repetitions of the 
material during sleep (5,12) and those 
that gave significantly fewer repeti- 
tions (1, 2, 3, 8, 13, 23; footnote 3). 
The large number of repetitions were 
actually spread over a number of 
nights and ranged from a total of 433 
times over a period of eight nights to 
16,200 times over a period of 54 
nights. The smaller number of repe- 
titions ranged from 8 to 180 times, 
with one study (13) playing the ma- 
terial five times per night for three 
successive nights. More repetitions 
were characteristic of the field as 
opposed to the laboratory studies. 
Both groups obtained both positive 
and negative results, although the 
tasks were varied sufficiently so that 
direct comparisons could not be 
made. LeShan’s (12) study, in which 
40 per cent of the experimental group 
stopped biting its nails, represented a 
successful example where a great deal 
of repetitious training seemed to have 
affected a semi-involuntary behavior. 

Stampfl (23) and Coyne (1) found 
no differences when they tried to 
study the effects of different amounts 
of repetitions on sleep-learning. Ac- 
tually since neither E used a non- 
sleep-trained group as a_ control, 
neither could conclude that any sleep- 
learning took place at all in this phase 
of their study. Coyne (1) suggested 
that in his study there may have been 
no greater differences in savings after 
one hour of repetitions than after four 
hours because the material was simple 
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enough to be learned in one hour and 
additional practice could not improve 
it. This is a reasonable hypothesis, 
as is its antithesis—that during sleep, 
learning is sufficiently slow so that 
little is learned even after four hours 
of repetitions. Sleep-learning, if it is 
to occur at all, may require that an 
extremely large number of stimulus 
repetitions take place. It will be 
important to evaluate sleep- versus 
waking-training from the standpoint 
of economy of both time and effort. 


Presentation 


The manner in which the study 
material is presented to the sleeping 
S has been considered by some Es as 
critical to the success or failure of the 
training. Two dimensions of this var- 
iable are the time of presentation and 
the order of presentation. 

The problem of presentation time is 
an important one since during the 
sleeping period, time is related to 
some extent to the depth of sleep, 
which in turn may be related to train- 
ability. Deeper sleep tends to be 
more prevalent in the early period of 
sleep, while lighter sleep tends to 
occur later (18). Of course, the levels 
vary considerably throughout a nor- 
mal night’s sleep. 

Coyne (1), studying the effects of 
presenting the material at different 
times during the sleep period, failed 
to equate his lists beforehand and 
could draw no conclusions from his 
results. He recognized that presenta- 
tion time might be inversely related 
to the amount retained. He sug- 
gested, however, that this was due to 
the recency of the presentation to the 
recall period. It is interesting to spec- 
ulate whether or not the Jenkins- 
Dallenbach interference effect (10) 
occurs within the sleep period for 
materials presented during sleep. 

The differences Coyne found be- 
tween massed and distributed learn- 
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ing might also be accounted for on the 
basis of presentation time. Although 
he concluded that distributed sleep- 
learning was superior to massed sleep- 
learning, there is no way to determine 
whether this was so because of the 
spacing of the training or because 
some of the distributed inputs oc- 
curred during the period just before 
waking—often a light drowsy state— 
while the massed inputs occurred 
only during the deeper and possibly 
less receptive period. 

The order in which the material 
was presented during sleep may also 
affect the results of sleep-training. 
In a waking state, serial learning has 
been shown to be easier than learning 
material presented in a varying order 
(7). Thus, if any learning takes place 
in sleep, a serial presentation would 
more probably increase any positive 
effects which might occur. Also, if 
the sleep state is one in which no 
mental organization takes place, this 
would favor the learning of only the 
more organized serial presentation. 

Hoyt (8) and Stampfl (23) varied 
the order in which their material was 
presented and failed to find that any 
sleep-learning took place. Fox (3), 
LeShan (12), Elliott (2), and Leuba 
(13) believed they found evidence of 
successful sleep-learning. Thurstone 
probably varied the order of his ma- 
terial, but his Ss were practiced over 
a period of months so that the effects 
of presenting the material in a vary- 
ing or an unvarying order may have 
been minimized. 

The varying presentation order in 
Hoyt’s study was methodologically 
different in one major respect from 
that used in Stampfi's study. Since 
the former study used paired-associ- 
ate material, varying the order in 
which the pairs were presented would 
have less effect than it had in the lat- 
ter study where the order of words in 
a list was varied during sleep-training, 
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even though they had to be recalled 
serially during the waking period. 
Any positive effects of sleep-training 
in the latter case may have been 
neutralized by the negative effects 
built up through nonserial learning. 


Training Problems, Materials, and 


Mode of Input 


The types of psychological prob- 
lems studied by the majority of sleep- 
training investigators have been quite 
limitesl. With the exception of 
Coyne’s (1) work on concept forma- 
tion and perceptual sets and LeShan's 
(12) and Hedges’ (5) therapeutic stud- 
ies, the remainder of the research 
has been involved with training prob- 
lems which require the memorization 
of word lists. It is unlikely that all 
types of problems are suitable for 
sleep-training, although exploratory 
studies should examine many rather 
than a few possibilities. 

Of the material used in the studies 
requiring verbatim recall, the degree 
of meaningfulness ranged from lists 
of nonsense syllables through short 
words to foreign language vocabu- 
lary. We know that the more mean- 
ingful the material, the easier it is to 
learn in the waking state (7). Stampfl 
(23) suggested that this might ex- 
plain why his results with nonsense 
syllables were poorer than Elliott's 
(2) who used a list of adjectives, and 
why Fox (3), using a Chinese-Eng- 
lish vocabulary, got even more strik- 
ing results. This does not seem to be 
a critical variable, however, for Hoyt 
(8), using the same Chinese-English 
vocabulary as Fox (3), got negative 
results, while LeShan (footnote 3) and 
Coyne (1), using nonsense syllables, 
got positive results. 

All of the Es used an auditory in- 
put. This is certainly the most obvi- 
ous technique and would appear to 
be the most economical; however, 
other sensory channels need not be 
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ignored. It may be necessary to flash 
lights on closed eyelids or to appl) 
tactual stimuli to the fingertips in 
code in order to “reach’’ the sleeping 


S. 


Techniques and Measures of Retention 


The techniques used to measure 
the retention of training material can 
have considerable influence on the 
amount of material recalled. How- 
ever, in the present studies, there did 
not appear to be a pattern of suc- 
cesses or failures consistent with the 
technique used. 

A number of Es did not require the 
verbatim recall of the verbal materia! 
presented during sleep. Instead, they 
evaluated retention on S’s ability to 
use the material in posttraining tasks 
or on an observed change in S's be- 
havior after the training. LeShan 
(12) examined his Ss’ fingernails ev- 
ery two weeks to see if the children 
had stopped biting their nails. This 
technique might have been slightly 
more objective had the examiner not 
known which children were and were 
not receiving the _ sleep-training. 
Hedges’ (5) clinical evaluations of his 
children’s speech improvement re- 
quired even more subjective judg- 
ments. Because of the very com- 
plexity of this measure, Hedges won- 
dered whether it was sensitive enough 
to detect improvement. Coyne (1), 
in several studies, had his Ss de- 
scribe an out-of-focus picture, the 
identity of which had been given to 
them during sleep, and to determine 
E’s solution of concepts composed 
of the correct combination of certain 
stimulus variables, the answer to 
which had also been provided during 
sleep. Thurstone’s Ss were evaluated 
on their ability to send and receive 
Morse code. Both Coyne’s and Thur- 
stone’s results could be quantified. 

Leuba (13) gave his Ss the titles of 
songs played during sleep and re- 
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quired them to recall unaided the 
lyrics. Positive results were claimed 
for sleep-training in this study. 

The savings method has been used 
by a majority of Es (1, 3, 8, 23; foot- 
note 3), for they believed it to be a 
more sensitive measure of retention. 
Stampfl (23) believed that although 
material could not be consciously re- 
called at any moment, its presenta- 
tion during sleep may have still modi- 
fied the nervous system sufficiently 
to make learning easier and a savings 
effected. However, use of the savings 
technique may confuse the measure 
of retention: with a measure of the 
ability to learn since both are con- 
founded within the same perform- 
ance score. None of the Es using the 
savings measure compared the time 
to relearn a list with the time it took 
the same S to learn an equivalent list 
presented after sleep. Comparisons 
with the original list presented before 
sleep are not sufficient. The positive 
transfer which occurs from the pre- 
test to the posttest-—the phenomenon 
of “learning how to learn’’—may ac- 
count for some of the apparent sav- 
ings which several researchers be- 
lieved they found. However, Stampfl 
(23) gave three practice lists to be 
learned before the experiment in or- 
der to bring Ss’ learning curve closer 
to its asymptote and thus reduce the 
‘learning to learn’”’ effect. 

A still more sensitive technique for 
measuring retention is that of recogni- 
tion; at least in Luh’s (16) classical 
study this was so after a two-day in- 
terval between training and the post- 
training test. Coyne (1) used this 
technique in the form of a multiple- 
choice test for one of his exploratory 
studies on perceptual set, but still 
failed to get positive results. Hoyt 
(8) told of an exploratory study in 
which he presented a single number- 
word pair to a sleeping S who was 
under constant observation for a total 





338 CHARLES W 
of 11 hours on three successive nights. 
On awakening after the third night, 
S was given the number and asked to 
pick the correct word from a group of 
ten. He was unable to do this cor- 
rectly. A major difficulty with the 
recognition method is that it must be 
corrected for chance guessing. With 
words, however, this correction for 
chance is difficult since all words do 
not have an equal probability of be- 
ing recognized (17). 

Variations in the score used to 
measure retention have failed to con- 
sistently differentiate success and 
failure in sleep-learning. Many of the 
Es (1, 2, 8, 23; footnotes 2, 3) used the 
number of trials to learn posttraining 
material to one perfect repetition as 
a measure of retention. Hoyt (8) re- 
quired that the lists be learned to two 
perfect anticipations; this may tend 
to make his results slightly more re- 
liable. Some Fs (1, 2, 8, 23) also used 
as measures of retention the number 
of errors or correct responses made on 
the first trial or later trials, or on 
cumulative trials. Where the Ss per- 
formance was used to measure re- 
tention, the measure was charac- 
teristic of the task, e.g., Coyne used 
the extent of focus of the projector 
as the measure of performance in his 
perceptual set study. The measure 
that is used may determine in part 
the results which are found. For ex- 
ample, Coyne (1) noted that the 
measure by trials often favored the 
opposite results than those favored 
by the measure by errors. Error 
measures generally lead to less con- 
clusive results than those measured 
by trials. This situation does occur 
in waking-research and need not be 
contradictory, although it certainly 
affects the conclusions drawn as well 
as the practical applications of the re- 
sults. 

Before final conclusions can be 
drawn concerning the feasibility of 
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sleep-learning, more recall techniques 
should be tried. Simon and Emmons 


(22) discuss this while considering 
the possibility of secondary cerebral 
storage mechanisms for material in- 
troduced during sleep.® 


SLEEP CRITERIA 


Perhaps the most damaging criti- 
cisms of the sleep-learning studies to 
date have been the inadequate con- 
trol of sleep and the criteria used for 
defining sleep. So elusive is the proc- 
cess of sleep from the psychological 
standpoint and so little is known 
about the actual physiological mech- 
anisms which cause sleep that the 
problem of determining whether S is 
asleep is to a great extent a semantic 
one. Although both direct observa- 
tion and S’s subjective report are 
reasonably reliable for deciding if S 
has been asleep over a block of time, 
neither can be considered sufficient 
to know if Sis asleep at any moment 
in time. ‘Therefore, the care with 
which £ determines the sleep condi- 
tion of his S at the time of the input 
is highly important and determines 
the degree of confidence which one 
may place in the conclusion that 
learning during real sleep is or is not 
possible. Some Es (1, 2, 3) eliminated 


*’ Dr. Bernard Fox, in a private communi- 
cation (26 June 1954), described his attempt 
to use hypnosis as a means of facilitating the 
recall of sleep-trained material. A 55-year- 
old man was brought to a point where deep 
hypnosis was possible. In his normal sleep 
he was presented with a list of ten words re- 
peated for a half-hour. During this period, 
S was observed and the input turned off 
whenever he moved. The next morning, he 
was unable to recall any words unaided. 
When he was hypnotized, he made only one 
association which might have been related to 
one of the words. Still under hypnosis, he 
recognized fewer words than would be expected 
by chance when the ten were interspersed 
randomly in a group of thirty. Neither read- 
ing the words aloud nor making him choose 
by a forced-choice method produced any more 
positive results. 
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S from the experiment if he awoke or 
said he heard the stimulus. Other Es 
(12, 23) shut off the stimulus until 5 
went back to sleep. 

In four of the studies (2, 3, 8, 13), 
Es asked Ss after the training if they 
had awakened. This is an unsatisfac- 
tory criterion of sleep for experimental 
studies since it is a common expe- 
rience to awaken during the night, 
perform a number of rational acts 
before retiring again and remember 
nothing the next day (8, 18). Hoyt 
stated that one S was observed to 
awaken during the night, leave the 
laboratory to go to the bathroom, 
and later return to bed. The next 
morning, S remembered nothing 
about it. In two of these studies (3; 
13) Es were not even present to ob- 
serve Ss as an added check.*® 

In one experiment (3) the time of 
night was considered a partial criter- 
ion of sleep; training was applied dur- 
ing the period when studies have 
shown sleep to be at its deepest. 
However, published sleep curves are 
based on statistical averages and 
cannot be used to predict accurately 
the sleep characteristics of the single 
individual. Individual sleep curves 
vary considerably from time to time 
during the night (15). 

Two of the Es (12, 23), before pre- 
senting the stimulus material, asked 
Ss if they were asleep. Since Le- 
Shan’s (12) Ss were adolescent boys 
in camp, naive as to the purpose of 
such questioning, one may wonder 
if asking them if they were asleep 
would be a sufficient check. It has 


6 Fox stated “if any Ss in the group of 30 
actually did wake up without reporting it, 
then the number was probably distributed 
about equally throughout the groups. More- 
over, since all Ss with one exception show ef- 
fects in the expected directions, it is clear that 
the results cannot be explained on the bases of 
a few Ss in each group who awakened but 
failed to report doing so”’ (3, p. 78). It may 
be on the basis of many. 
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been observed that even willing Ss of- 
ten fail to respond to such questions, 
although they admit hearing them 
later. Another major difficulty with 
this technique is that S may actually 
have been asleep when the test ques- 
tion was asked, but later would 
awaken sufficiently to hear the sub- 
sequent input. In these studies, no 
systematic check was made to be 
sure that Ss did not awaken after the 
initial test, except to note if they 
moved. 

Coyne (1) and Hoyt (8) had their 
Ss press a button whenever they awoke 
in order to stop the input of training 
material. This method alone is not 
sufficient since it has been found that 
often the desire as well as the ability 
to do this is not present although Ss 
are able consciously to hear and un- 
derstand the material. Since much of 
the material used in these studies 
was short and specific, even a few 
seconds of waking presentation would 
have been sufficient to allow Ss to 
hear and learn much of the stimulus 
while awake. 

Stampfl (23), LeShan (12), and 
Hoyt (8), in addition to asking their 
Ss if they were or had been awake, 
also noted whenever they moved, and 
turned off the machine at that time. 
This criterion of sleep is not com- 
pletely satisfactory since EEG rec- 
ords indicate that sleep may lighten 
to almost a wakening state before 
movement occurs, as well as after- 
wards, or when there has been no 
movement at all (15). Hoyt (8) 
found that two of his Ss said they 
had awakened but showed no ob- 
servable movements, while three sub- 
jects said they had not awakened but 
showed from two to six movements 
each. This suggests that the criteria 
of asking and noting movements did 
not correlate very well. In Hedges’ 
(5) study, the parents occasionally 
observed the children to see whether 
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or not they awoke during the sleep- 
training; two children awoke. In the 
first case the number of training repe- 
titions were not counted for the time 
S was awake. However, without a 
waking control, this correction is 
meaningless. The other S awoke 
while the recording was playing and 
began drowsily to follow the instruc- 
tions given on the record. Hedges 
felt this aided the progress observed 
in that case. 

Elliott (2) used the electroencephalo- 
gram to determine the S's sleep level. 
Independent workers using a variety 
of techniques have established a sig- 
nificant correlation between the brain 
wave patterns and the depth of sleep 
(14). Because the writers believe 
that properly evaluated electroen- 
cephalograms represent the most ob- 
jective, continuous, and practical on- 
the-spot indicator of sleep depth 
available, Elliott's (2) positive re- 
sults—often quoted by the commer- 
cial firms in support of their claims 


for sleep-learning—are quite note- 
worthy. An examination of his sleep 
criterion revealed one major flaw, 
i.e., Elliott did not keep the EEG 
running during the entire training 


period. Therefore, no continuous 
check was made of the S’s sleep level 
while the stimulus was being played. 
The EEG was turned on at the 
beginning of the training and kept on 
until E believed that sleep would re- 
main deep enough; then he would 
shut it off for the remainder of 
the half-hour stimulus period. With 
Elliott’s data, a correlation of —.39 
was found between the amount 
learned and the amount of monitor- 
ing. This was not significant at the 
5 per cent level but in a direction 
which suggested that more savings 
occurred with less monitoring. The 
writers (22) also compared the 
amount of savings made by a group 


98 Fie ay ee 


CHARLES W. SIMON AND WILLIAM H. EMMONS 


of Elliott’s five Ss who were moni- 
tored nearly all of the training time 
with the savings of the remaining 
fifteen Ss who were monitored on an 
average slightly more than a fourth 
of the time, and never more than 54 
per cent of the time. For the first 
group, the average savings in trials 
to learn was 27.5 per cent; for the sec- 
ond group, the average savings was 
45.0 per cent. The difference of 17.5 
per cent was highly significant below 
the 2 per cent level of significance and 
we can conclude that when a thor- 
ough check of the sleep condition was 
made with the EEG during training, 
considerably less savings took place. 

That any savings occurred for the 
groups monitored nearly 100 per cent 
of the time might be accounted for in 
two ways without assuming that 
learning actually occurred during 
sleep. First, the apparent savings 
may be simply an effect of the “‘learn- 
ing how to learn’’ phenomenon dis- 
cussed earlier in the section on 
“Methodological Considerations.” 
Second, Elliott stated that whenever 
an alpha pattern was “‘clearly’’ pres- 
ent he concluded S was awake and 
turned off the input machine. In so 
doing, he quite likely permitted many 
of his Ss to listen while they were al- 
ready awake since movement, tense- 
ness, or the opening of the eyes may 
sometimes block a clearly established 
alpha. 

The expense of the EEG equip- 
ment negates its widespread use. 
This does not mean that without it, 
however, adequate  sleep-learning 
studies cannot be done. The use of a 
combination of the criteria and con- 
tinuous monitoring may be sufficient 
to insure some rough but adequate 
control of S’s sleeping condition. It 
is interesting to note that Hoyt (8) 
and Stampfi (23), finding negative 
results, were the two Es using the 
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greatest number of multiple and con- 
tinuous criteria. 

The continuity of monitoring can- 
not be overemphasized; for in spite 
of wishful thinking to the contrary 
Ss do awaken while the input is being 
presented.’ Two out of three of 
Hedges’ (5) Ss awoke. As many as 
half of Coyne’s (1) Ss awoke in some 
of his studies. Elliott (2) reported 
that six of his Ss awoke. Only two 
out of sixteen of Hoyt’'s (8) Ss in the 
experimental group failed to awaken 
during an input period. LeShan (12) 
also stated his Ss awoke, but did noi 
specify how many. One-fourth of 
Fox’s (3) Ss awoke, and were elimi- 
nated from the study. 

The conditions under which Ss 
sleep can influence to some extent the 
soundness of their sleep. Elliott (2) 
and Hoyt (8) required Ss to sleep in 
the laboratory one night previous to 


the experiment. Any sleeplessness 


occurring on this first night probably 
induced a deeper sleep on the second. 


Hedges (5), Fox (3), and Leuba (13) 
used Ss while they slept in their own 
homes, while Thurstone (footnote 2), 
LeShan (12), and Stampfi (23) used 
Ss who had been sleeping in familiar 
“homes away from home” for some 
time. 

Although most of this discussion 
has centered around the problem, 
Was the sleep state deep enough?, 
Stampfl (23) and Leuba (13) sug- 
gested that there is an intermediate 
point between waking and deep sleep 
that is optimal for sleep-learning. 
Some negative results, they believed, 
may have been obtained because S$ 
was too deeply asleep. Whether this 
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is true or not is an experimental ques- 
tion for future Es to answer. 


SUMMARY AND CONCLUSIONS 


Ten sleep-learning studies were re- 
viewed. Many of these have been 
cited uncritically by commercial firms 
or in popular magazine and news ar- 
ticles as evidence in support of the 
feasibility of learning during sleep. 
A critical analysis was made of their 
experimental design, statistics, meth- 
odology, and criteria of sleep. All of 
the studies had weaknesses in one or 
more of these areas. 

It is highly speculative whether or 
not the studies reviewed in this paper 
have presented any acceptable evi- 
dence that learning during sleep is 
possible. The inadequate control of 
a number of experimental variables 
makes the validity of the conclusions 
drawn by many of the Es unwar- 
ranted. The conditions under which 
the results were found tend more to 
support the contention that some 
learning takes place in a special kind 
of waking state wherein Ss appar- 
ently do not remember later on if they 
had been awake. This may be of 
great practical importance from the 
standpoint of economy in study time, 
but it cannot be construed as s/leep- 
learning. More carefully controlled 
experiments in the future may pro- 
vide us with a clearer answer to the 
question,” “Can one learn during 
sleep?,”’ as well as to provide com- 
parative data between waking and 
resting learning from the standpoint 
of economy of time and effort. The 
problem is partially confounded by 
an inadequate definition of sleep. 
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TEACHING IN THE IVORY TOWER, WITH RARELY 
A STEP OUTSIDE 
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In recent years, psychologists have 
literally gone all over the world, and 
addressed themselves to a great va- 
riety of problems. In so doing, they 
have tried a variety of training and 
instructional methods, often on the 
job or in the field, and have evaluated 
those methods realistically, as with 
reference to outcomes in production 
records or combat data. One might 
hope that all this would have in- 
vigorating influences upon methods 
in the teaching of psychology, and ap- 
praisals of them. 

Instead, an extensive and careful 
recent review! of investigations of the 
teaching of the subject since 1942 
shows the largest single group dealing 
with phases of the old comparison of 
lecture versus discussion in the ele- 
mentary course, as appraised most 
commonly by final examinations. 
And the reviewers seem to excuse the 
minimal imagination as to methods 
and minimal results by implying a 
minimal ‘“‘degree and nature of 
change that one can produce in a one- 
or-two-semester course in psycholo- 
gy” (p. 52). But why are not these 
limitations of time a challenge to use 
varied and vigorous methods, chosen 
on the basis of incisive evaluation? 
Yet the review seems not to mention 
any use of laboratory methods, nor 
any projects, field trips, or field ex- 
perience. And the great majority of 
the evaluations seem to have been in 
terms of paper-and-pencil tests, usu- 
ally given in the evaluated course 
(not later or elsewhere). Only talk- 


1 Birney, R., & McKeacuiz, W. The 
teaching of psychology: a survey of research 
since 1942. Psychol. Bull., 1955, $2, 51-68. 


talk in a college classroom as a means 
of bringing about learning, and only 
markings there on a piece of paper 
to evidence whether significant learn- 
ing has occurred! 

But many other methods are pos- 
sible, even in a big elementary course 

as the writer knows from over 25 
years experience supervising such a 
course, enrolling some thousand stu- 
dents a year (mostly freshmen), 
taught mostly by graduate students. 
There can be informal and social 
laboratory tasks, ranging from gen- 
uine research projects which the 
students understand as such and are 
later reported in print, or the making 
of simple equipment such as charts 
for the course, to informal half- 
humorous data gathering regarding 
the “social psychology” of freshmen. 
There can be field trips, as to an in- 
stitution, or field work as helping in 
the survey of a public school. And 
evaluations can include careful check- 
ing that the laboratory research is 
well done and the charts well made, 
notations that the school principal 
spoke favorably of the survey as- 
sistants and that after an institution 
visit, a girl asked intelligent ques- 
tions about career opportunities in 
work there, or an anecdotal record 
that on a field trip an initial isolate 
had become an accepted member of 
a group. 

The Wolfle committee doubted the 
legitimacy of many such doings as 
either too elementarily useful or too 
special: one should ‘‘teach psychol- 
ogy’’; and only certified clinicians or 
counselors could properly deal with 
many of these problems. But be- 
ginning courses serving mostly stu- 
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dents who will not go further in the 
subject surely should be realistic 
about them and their needs—can 
thus best serve them and give them 
some functioning understanding of 
the subject. And bringing it about 
that an isolated student gets ac- 
quainted need require little more of a 
good instructor than being a good 
host; even aid in early consideration 
of a vocational choice sometimes re- 
quires little more than mature good 
sense, with referral. 

The reviewers report the extraor- 
dinary finding that sophomores get 
better marks in a first course in psy- 
chology than do freshmen, and seem 
to agree with the inference that the 
first course should therefore not be 
given in the first year. But if the 
first course were not taken till gradu- 
ate school, grades would be still bet- 
ter! Surely not where students do 
best is the place for a course, but 
where they need it the most. 

The rather unhappy reviewers con- 
clude with the statement that the 
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most serious need, in research in 
teaching, is to get it ‘related in some 
way to general psychological theory”’ 
(p. 64).2. Climb higher in the ivory 
tower! Why not instead come down 
and open the door, watch what stu- 
dents do outside class, see what psy- 
chologists are doing in the wide world, 
and even venture off campus occa- 
sionally, to mix with the folks on 
Main Street. 


Received March 24, 1955 


? There could well be more consideration of a 
broad theory of higher education. More than 
any other subject field, psychology should 
have appreciation of the great variety of gains 

~in abilities, understandings, personality 
traits—possible during the college years. 
Psychologists have a great variety of con- 
tributions to make, in furtherance of those 
gains. Some are highly special, as in counsel- 
ing or therapy. But the basic contention of 


this commentary is that there remains a great 
range of possible contributions, most best at- 
tempted by broadened concepts of and meth- 
ods in teaching—and that the excellent stud- 
ies summarized in the review cover that range 
The plea 


only from A perhaps through C, 
is for a wider range of experiment. 
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Kyoto University 


In the former Psychological Ne- 
crology (1903-1927) by E. G. Boring 
(2) two Japanese psychologists were 
listed, namely, Y. Motora and T. 
Nakashima, but in the new one by 
Bennett and Boring (1) none were 
included. Perhaps this is because 
communication between American 
and Japanese psychology was inter- 
rupted during World War II. How- 
ever, Japanese psychology has de- 
veloped a great deal during this pe- 
riod. Among the deceased psycholo- 
gists in Japan there are a number who 
are worthy of being included in Bor- 
ing’s new list. From a total of 45 de- 
ceased psychologists, I selected 25 
names and asked the leading psychol- 
ogists in Japan, Tatsuro Yatabe 
(Kyoto Univer.), Toru Watanabe 
(Nihon Univer.), Matsusaburo Yoko- 
yama (Keio Univer.), Sadaji Takagi 
(Tokyo Woman's Christian College), 
Megumi Imada (Kanseigakuin Uni- 
ver.), Yoshikazu Ohwaki (Tohoku 
Univer.), and Torao Obonai (Tokyo 
Univer. of Education), to check them 
against the level of Boring’s list. The 
following 13 were selected: 


Fukutomi, Ichiro, Kenkoku Univer., 
Manchuria, b. 8 June 1891, d. 27 
Feb. 1946. 


. ° “9 - ee ° 
Hayami, Hiroshi, , Keijo Univer., 


Korea, b. 23 Oct. 1876, d. 27 June 
1943. 

Iwai, Katsujiro, Kyoto Univer., b. 
17 Feb. 1886, d. 2 Nov. 1937. 
Kakise, Hikozo, Ministry of Educa- 
tion, Tokyo, b. 13 Sept. 1874, d. 

23 April, 1944. 

Kuroda, Ryo, Keijo Univer. and Gifu 
City, b. 30 Jan. 1890, d. 5 Jan. 
1947. 

Kubo, Yoshihide, Hiroshima Bun- 
rika Univer., b. 22 Apr. 1883, d. 
12 Nov. 1942. 

Masuda, Koreshige, Tokyo Univer., 
b. 29 Dec. 1883, d. 7 Aug. 1933. 

Matsumoto, Matataro, National 
Academy of Science, Tokyo, b. 15 
Sept. 1865, d. 24 Dec. 1943. 

Morita, Shoma, Jikeikai Medical 
School, Tokyo, b. 18 Jan. 1874, d. 
12 Apr. 1938. 

Nakamura, Katsumi, Tokyo Univer. 
of Education, b. 7 Feb. 1901, d. 6 
Apr. 1952. 

Onoshima, Usao, Ministry of Educa- 
tion, Tokyo, b. 16 Mar. 1894, d. 
24 Jan. 1941. 

Rikimaru, Jien, Taihoku Univer., 
Formosa, b. 3 Jan. 1890, d. 4 Dec. 
1945, 

Takashima, Heizaburo, Toyo 
Univer., Tokyo, b. 1 Oct. 1865, d. 
15 Feb. 1946. ‘ 
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Narziss Ach was born on 29 October 
1871 (not in 1848), 
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HANDBOOK OF SOCIAL PSYCHOLOGY! 
DANIEL KATZ 


University of Michigan 


A handbook in any field of knowl- 
edge is an ambitious undertaking. 
When that field is growing so rapidly 
that changes of major significance 
take place between the writing of a 
manuscript and its publication and 
when the field ramifies widely with 
ill-defined boundaries, the task is 
even more formidable. In this per- 
spective the two volumes edited by 
Lindzey are a magnificent achieve- 
ment. They replace all existing texts 
and sourcebooks or any combination 
of text and sourcebook. They present 
clear and incisive accounts of the 
major theoretical positions in social 
psychology and they give good cover- 
age to the thinking and research on 
all the traditional topics in the field. 
Many of these topics are old chapter 
headings in social psychology (humor 
and laughter, facial expression, mass 
phenomena) which more recent texts 
have omitted. And there is an ex- 
cellent section of nine chapters on 
research methods. Moreover, many 
of the chapters are more than reviews 
of the literature in their insightful 
integration or in their suggestions for 
future research. ‘Though social psy- 
chology has been characterized by 
sectarian trends and an impatience 
with an historical understanding of 
its problems, the Handbook of Social 
Psychology should help to counteract 
these forces. Some of the chapters are 
still too selective but the over-all pat- 
tern is one of real progress. High 


1G. Lindzey (Ed.) Handbook of social 
Psychology. | Cambridge: Addison-Wesley, 
1954. Vol. I (pp. 588+x), Vol. II (pp. 601 
1226+x), $8.50 one volume. $15.00 per set. 
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editorial standards have been main- 
tained and the writing is not only 
clear, but for a handbook, unusually 
interesting. Almost any adequate 
organization of chapters in this field 
would have problems in _ finding 
mutually exclusive areas but the 
amount of overlapping in the liter- 
ature reported by the various authors 
is not great. 

The opening chapter by Gordon 
Allport presenting the historical 
background of modern social psychol- 
ogy is one of the finest in the two vol- 
umes. It is a convincing argument 
for the importance of the historical 
approach. It shows the similarity 
and, sometimes, the virtual identity 
of contemporary concepts and prob- 
lems with the ancient history of our 
discipline and demonstrates how 
easily some of the blind alleys in our 
development might have _ been 
avoided with more consideration of 
the thinking and work already avail- 
able. If its wise precepts were to be 
followed we might temporarily solve 
our overcrowded journal situation 
by seeking for significant research 
problems rather than following the 
latest fad in labels and techniques. 
This chapter is rightly titled ‘His- 
torical Background” in its concentra- 
tion on the similarities between cur- 
rent ideas and their early counter- 
parts. What would also be desirable 
would be a companion chapter on his- 
torical development which with the 
same bold strokes would show how we 
have moved from the early stages to 
our present situation. This, how- 
ever, is not the conventional task of 
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a handbook. The reviewer would 
make one minor criticism reflecting 
his own bias. There is too little ma- 
terial on the objective approach rep- 
resented by such writers as F. Oppen- 
heimer, E. Durkheim, K. Marx, and 
E. B. Holt. 

Of the five chapters devoted to 
contemporary systematic positions, 
those on field theory and on Freudian 
theory are superior presentations 
in their clarity, organization, and 
relevance to social psychological 
problems. M. Deutsch does a re- 
markable job of exposition in con- 
densing Lewinian metatheory, the 
formal concepts of field theory, and 
the research contributions of this 
school within the confines of a single 
chapter. His judgment that the ex- 
perimental contributions of the move- 
ment stem from Lewinian metatheory 
rather than from Lewin’s brilliant in- 
sight into empirical problems is prob- 


ably sound. This does not mean, 


however, that the significant research 


activities of the school are the out- 
come of a formal set of theoretical 
concepts. The lasting contribution 
of Lewin thus may be a way of think- 
ing about problems rather than a 
theoretical system. This is suggested, 
too, by the fact that Lewinian stu- 
dents have gone off in many different 
directions with some identification 
with a point of view and less adher- 
ence to the development of a com- 
mon set of formal concepts. An in- 
teresting discussion in the Deutsch 
chapter centers about the problem 
of psychological ecology—namely, 
how field theory can take account of 
objective environmental forces, for 
in field theory these forces must be 
transformed into the psychological 
environment. The author accepts 
the concept of Wright and Barker 
of the behavior setting as ‘‘any part 
of the nonpsychological milieu that 
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is generally perceived by the people 
of a community as appropriate 
for particular kinds of behavior.” 
Such behavior settings are coercive 
because of the shared frames of refer- 
ence or collective intentions concern- 
ing appropriate behavior. This is one 
way of defining social norms and of 
accounting for the consistency of the 
resulting behavior. But it does not 
help on the problem of ecology. It 
still must be established that each 
individual subject has the same ex- 
pectancies as his fellow and that he 
knows that his fellows have similar 
frames. 

The Hall-Lindzey treatment of 
Freudian theory and its influence on 
social psychology is again an excel- 
lent summary and interpretation. If 
anything, it is on the conservative 
side in evaluating Freudian influence. 
Even where social psychologists have 
not accepted Freudian theory as a 
system they have utilized extensively 
Freudian concepts and Freudian 
mechanisms. Though their use of 
projection, displacement, and identi- 
fication have not always accorded 
with Freudian theory, they have come 
to fall back upon Freudian dynamics 
for purposes of explanation. The 
early impact of Freudian theory, as 
in F. H. Allport’s Social Psychology 
and E. D. Martin’s Crowd Behavior, 
is also minimized in this chapter. It 
might be added that the tremendous 
vitality of Freudian theory is due in 
part to the fact that its mechanisms 
have some basis in the actual observa- 
tion of human behavior. 

In reporting on major trends in 
cognitive theory, M. Scheerer skill- 
fully analyzes some of the basic is- 
sues about which stimulus response 
and organizational theorists have 
been in disagreement. The similari- 
ties of the concepts employed by 
cognitive theorists are interestingly 
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developed and are related to many 
problem areas. This chapter is an 
important contribution but is per- 
haps of more value for general psy- 
chology than for social psychology. 
W. W. Lambert had the even more 
difficult task of relating stimulus- 
response theory to social psychology. 
His chapter is better on the presenta- 
tion of stimulus-response theory than 
in dealing with the relationship to 
social psychology. The reasons for 
the decline of behavioristic theory in 
social psychology are not faced. One 
reason for the decline lies in our in- 
ability to define clearly the stimulus 
in social situations. Until we can do 
this, reinforcement theory will be 
able to give us explanations only after 
the fact. Social psychology desper- 
ately needs a new objective approach, 
but Lambert’s chapter is an indica- 
tion that it can not come from the old 
stimulus-response psychology with 
its atomism and its grossly oversim- 
plified models. 


The most difficult assignment in 
the theory section was that given to 


T. R. Sarbin on role theory. While 
this is a key area for social psychology 
in relating psychological processes to 
social structure, there is a dearth of 
good material on the subject. Sarbin 
is rightfully concerned with the rela- 
tion between role and self, but he 
gives little attention to the relation- 
ships existing in a system of roles. It 
is not very fruitful to consider roles 
in isolation, for the virtue of the con- 
cept is that it takes us beyond the in- 
dividual into the social system which 
gives the role its meaning. The con- 
cluding statement of the chapter pos- 
tulates variation in role enactment as 
a function of at least three variables: 
validity of role perception, skill in 
role enactment, and the current or- 
ganization of the self. This expresses 
well the orientation of the author 
and his preoccupation with the pe- 
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ripheral problems of role theory. 
Nothing is said about the demand 
character of the role, the sanctions 
employed in the role system, the 
function which the role has for the 
whole system, the competing nature 
of other roles, the nature of the col- 
lective structure of which the role is 
a part, etc. 

The section on research methods 
is not a complete coverage of meth- 
odological and measurement prob- 
lems, but most of the major areas are 
included and the treatment repre- 
sents a high level of competence. The 
introduction by Allen Edwards on 
“Experiments: Their Planning and 
Execution” gives the proper anchor- 
age in experimental design and pro- 
cedures. F. Mosteller and R. R. 
Bush follow with an account of se- 
lected quantitative techniques which 
focuses on the needs of social psy- 
chologists and which emphasizes non- 
parametric methods. Just as we re- 
quire all students to take courses in 
statistics, so too should we require 
writers of statistical texts for these 
prescribed courses to read the chap- 
ter by Mosteller and Bush to see how 
quantitative techniques can be ef- 
fectively communicated. The chapter 
on attitude measurement by B. F. 
Green is a valuable addition to this 
section and summarizes the assump- 
tions, procedures, and uses of the 
methods and models in this field. As 
in the preceding chapter the author 
is down to earth in his common-sense 
judgments about the developments 
in attitude measurement. The two 
chapters on data collection ‘‘System- 
atic Observational Techniques” by 
R. W. Heyns and R. Lippitt and the 
“Interview: A Tool of Social Science” 
by E. Maccoby and N. Maccoby) 
are nice combinations of theoretcial 
analyses and practical considerations 
of the researcher’s problems. The 
chapter by the Maccobys is note- 
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worthy in cutting through the ver- 
bose controversies about techniques 
and in dealing with the essential di- 
mensions of the interviewing prob- 
lem. It is valuable too for its report- 
ing of methodological research. The 
chapter on observational techniques 
will be very useful because of its in- 
clusion of many different types of ob- 
servational scales. As the authors 
suggest, we need to have more in- 
vestigators utilizing the same ob- 
servational system if we are to build 
knowledge in our field. The growing 
popularity of the sociometric tech- 
nique and its applicability to social 
interaction and social structure are 
recognized in a special chapter on 
‘“‘Sociometric Measurement” by G. 
Lindzey and E. F. Borgatta. The 
centrality of the technique to social 
psychology is emphasized and de- 
velopments both on the data collec- 
tion side and on data analysis are de- 
scribed. The authors felt the neces- 
sity of choosing between a concept 


of sociometric technique which fol- 
lowed closely Moreno’s early work 
and a broad concept of the term 
equivalent to the measurement of so- 


cial behavior. In choosing the 
former, they neglected the alternative 
of accepting the basic technique of 
reciprocal preferential naming or 
ranking in the Moreno tradition, but 
of applying it to a much wider range 
of problems than personal attractive- 
ness, leadership, and informal pat- 
terns in small groups. It is of interest 
that as this chapter was being written 
there was already under way a re- 
search program carrying the appli- 
cations of the sociometric technique 
into new areas, namely, the identifi- 
cation of organizational structure and 
the processes of communication and 
control within this structure (the 
studies of E. Jacobson, S. Seashore, 
and R. Weiss). 


B. Berelson’s book on Content Anal- 
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ysts in Communication is neatly 
condensed by its author to describe 
the procedures, uses, and problems 
of qualitative analysis. The weakest 
chapter in the methods section is the 
one on cross-cultural method. There 
is little consideration of the methodo- 
logical problems of dealing with sec- 
ondary rather than primary data and 
little attention to the possibilities of 
controls in the collection of primary 
data. In all fairness, however, this 
chapter is representative of the state 
of the field. Fortunately, the future 
is more promising in that cross-na- 
tional research has led to methodo- 
logical thinking and there is now in 
preparation a Unesco-sponsored re- 
port by S. Rokkan who, with H. C. J. 
Duyker, has already outlined some 
of the dimensions of the problem in 
the Journal of Social Issues, Vol. 10, 
No. 4, 1954. 

The final chapter in the research 
methods section by D. O. Hebb and 
W. R. Thompson is on the ‘Social 
Significance of Animal Studies."’ Par- 
adoxically, it has as much, if not 
more, to offer to the social psycholo- 
gist as any of the chapters in the two 
volumes in its wise counsel, in its per- 
spective, and in its insights. I¢ af- 
fords the stimulation one would ex- 
pect from the chapter on “Culture 
and Behavior.”’ ‘The social psychol- 
ogist can study with profit the de- 
velopment of different order concepts 
for the description of animal be- 
havior—descriptive categories which 
grow out of intensive preoccupation 
with an understanding of the phe- 
nomena being investigated. In brief, 
this chapter on animal studies is a 
strong argument for the inclusion of 
a biologically oriented approach to 
the problems of social psychology. 

The second volume deals with so- 
cial interaction and applied social 
psychology. Chapter 18, ‘Socializa- 
tion” by I. L. Child, and Chapter 26, 
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“National Character’ by Alex 
Inkeles and D. J. Levinson, are par- 
ticularly impressive. Both of these 
accounts are thoughtful in their or- 
ganization and interpretation and re- 
sourceful in their suggestions for fu- 
ture research. Child, without sacri- 
ficing rigorous standards, faces fully 
the contradictory findings and the 
weakness of the research methods in 
some of the studies, and still makes 
maximum sense of his material. Re- 
search on the socialization process is 
in a better position for advances be- 
cause of this chapter. Inkeles and 
Levinson, in their brilliant account, 
develop fully the implications of na- 
tional character both as an _ inde- 
pendent and as a dependent variable 
in social psychology. This is a strong 
chapter methodologically in its speci- 
fications for common, standard em- 
pirical categories of observation, and 
in its promising framework for future 
research. Moreover, it comes closer 


than any other chapter in adequacy of 


treatment of social structure variables. 
One of the classic chapters in the 
original Handbook of Social Psychol- 
ogy, edited by C. Murchison, was 
Dashiell’s summary of the experi- 
mental studies of group effects. This 
topic is now covered by two chapters 
which ably represent the new concep- 
tualization of group variables and the 
new types of group experiments 
Chapter 21, “Experimental Studies 
of Group Problem Solving and Proc- 
ess’ by H. H. Kelley and J. W. 
Thibaut, and Chapter 22, ‘“Psycho- 
logical Aspects of Social Structure” 
by H. W. Riecken and G. C. Homans. 
The work of Kelley and Thibaut is a 
substantial contribution and, al- 
though it neglects some historical 
origins, is the most comprehensive 
and understanding account of the 
experimental literature available. 
The pieee by Riecken and Homans 
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is limited but ingenious. It isan in- 
teresting attempt to reduce group 
activity to a few simple principles, 
most of which are generalizations 
from individual psychology. For 
example, the authors offer the hy- 
potheses: (a) a member of a group 
chooses or likes another member to 
the degree to which the latter's ac- 
tivities realize the former’s norms 
and values, (b) the more one member 
likes another the more he will origi- 
nate interaction with him, (c) the 
liking of one member for another is 
the product of the amount of interac- 
tion and the importance of the second 
member's activities in terms of some 
group norm. Basic to much of the 
thinking in this chapter is the old 
doctrine of self-consistency. Granted 
the validity of the concept, many 
social psychologists may want to 
know how much can be expected from 
this concept to help in an under- 
standing of group process and group 
structure. 

In keeping with sound tradition 
there is the usual anthropological 
piece on culture and behavior. This 
time it is presented on a grander 
scale than usual, albeit in somewhat 
verbose fashion. It is essential that 
social psychologists should be con- 
stantly reminded of how culture- 
bound their assumptions, concepts, 
and methods can be. At some point, 
however, there should be hope of 
systematic substantive findings in 
this area as well as the expression of 
a point of view. C. Kluckhohn has 
taken a forward step in not only ad- 
mitting the possibility of universal 
behaviors but of urging their study. 
The theoretical position of the article 
is difficult for the psychologist to fol- 
low. On the one hand we are told that 
anthropologists and _ psychologists 
start from data of the same order, 
namely, human behavior, and that 
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they may even start from the same 
particular data. The differences come 
at the level of abstraction and con- 
ceptualization. This is a clear and 
accurate statement. But then we are 
told that we should consistently and 
explicitly recognize the interdepend- 
ence of cultural and psychological 
phenomena. Here is the source of the 
real confusion. If anthropology is on 
a different level of interpretation 
than psychology of the same phe- 
nomena, then we can not talk about 
the interdependence of the phenom- 
ena. If one observer conceptualizes 
the flight of soldiers in battle as the 
ascendance of fear motivation and an- 
other sees it as the decline of an em- 
pire, we can not put these interpreta- 
tions into the same system of data as 
interdependent variables. This con- 
fusion between the level of data and 
the level of concepts limits the chap- 
ter in its attempts at integrating cul- 
ture and behavior. 

The section on applied social psy- 
chology covers four topics: prejudice 
and ethnic relations, effects of mass 
media, industrial social psychology, 
and the psychology of voting. These 
chapters maintain the high level of re- 
porting and organization of the //and- 
book. It is most helpful to have an 
article on industrial psychology which 
sees the problems as field problems in 
social psychology, as does M. Haire’s 
sophisticated and stimulating pres- 
entation. Haire rightly fingers so- 
cial organization as the focal point 
in industrial psychology, but he does 
not take full advantages of the studies 
and thinking of the workers in this 
field (Barnard, Simon, Whyte, Wor- 
thy, etc.). The Hovland chapter on 
mass media is a welcome addition 
to social psychology. It brings 
together material which otherwise 
might escape the social psychologist. 
It is well organized and research ori- 
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ented in approach so that the inter- 
ested reader is stimulated to do ex- 
periments to fill the indicated gaps. 

There are two major omissions in 
the comprehensive list of chapters, 
and even these may be considered a 
reflection of the present state of our 
discipline rather than an error of edi- 
torial planning. The first major 
lack is the absence of any systematic 
treatment of social-structure varia- 
bles. That small face-to-face groups 
affect behavior is recognized and at 
the other extreme there is a chapter 
on culture and behavior. But be- 
tween these extremes there is no ac- 
count of the major forces affecting 
the lives of people save as they are 
peripherally treated in chapters de- 
voted to some other topic. The most 
elementary fact about any society is 
that behavioral relationships are 
structured and patterned in the form 
of social organizations, institutions, 
and groups. Power is exercised as a 
function of existing social systems, 
and people live, struggle, fail, and 
prosper in relation to the social struc- 
ture of which they are a part. Social 
behavior does not occur in a vacuum 
but even the chapter on role behavior 
treats social roles as if they existed 
apart from the arena of group strug- 
gle and group functioning. The chap- 
ter which might be expected to deal 
with this material, ‘Psychological 
Aspects of Social Structure,” is con- 
cerned only with small groups, and 
its attempts at general principles take 
no account of the nature of social 
structure. Social class, stratification, 
the exercise of power, the analysis 
of institutions, the mechanisms of 
social control, social movements, and 
change are neglected. This neglect 
is in part due to the heavy phenom- 
enological bias of present-day social 
psychology and, correspondingly, of 
the authors, 
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The second major omission is the 
failure to include a consideration of 
the relationship between personality 
variables and social variables. Many 
of the pages are devoted to reducing 
social problems to personality dimen- 
sions. Many group effects, for exam- 
ple, are reduced to the individual’s 
striving for self-consistency. The fit 
between personality tvpe and social 
structure is described in the chapter 
on national character, but other as- 
pects of the interaction effects of 
personality and social variables are 
not explored. In other words, we 
have perpetuated the old dichotomy 
of approaches: either all individuals 
are affected similarly by group condi- 
tions or all group effects are explained 
as the expression of personality mech- 
anisms. If social psychology has any 
unique subject matter, it may well lie 
in this neglected area of the interac- 
tion effects of personality and social 
settings. 

The publication of a handbook 
makes appropriate some observa- 
tions about the development of a 
field in more global terms than is or- 
dinarily the case. When the hand- 
book represents fairly accurately, as 
does the present one, the principal 
areas of effort and the major theoreti- 
cal approaches, this assessment is 
greatly facilitated. If one were to 
compare present-day social psychol- 
ogy with the state of the field when 
the Murchison handbook appeared 
in 1935, many changes would be obvi- 
ous. In the first place there has been 
a marked shift from a behavioristic 
approach to field theory and to 
Freudian theory. Perception and 
cognition have become central topics 
for many investigators, and where 
there is an attempt to go beyond phe- 
nomenology, the recourse is to Freud- 
ian mechanismsand dynamics. Inthe 
second place there has been a growth 
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of techniques and methods which are 
appropriate to social-psychological 
problems. There has been a corre- 
sponding increase in the quantity and 
quality of research and experimenta- 
tion. Moreover, social-psychological 
research has on numerous occasions 
become a collective enterprise with 
many large-scale projects utilizing 
the cooperative efforts of numerous 
workers. In fact, some of the chap- 
ters in this handbook have involved 
a group of colleagues in their prepara- 
tion. Then too, many new subject 
matter areas have been investigated, 
including political and economic be- 
havior. 

Somewhat less obvious among the 
changes is the origin of research and 
experimentation. The older social 
psychology started from everyday 
problems and attempted to account 
for the variance of behavior in these 
situations; experiments were more the 
miniature of the practical problem, 
as in the work on prestige suggestion 
or the accuracy of jury judgments. 
Today these practical interests con- 
tinue, but in addition the springboard 
for research is more often an hypothe- 
sis based upon some theoretical in- 
sight or attempt at derivation from a 
theoretical model. This is undoubt- 
edly a healthy sign in the growth of 
a science. But in the effort to avoid 
framing a problem in applied terms, 
it is not necessary to divorce theoreti- 
cal formulation from the data of our 
field. The chapter by Hebb and 
Thompson suggests the importance 
of intensive concentration upon the 
phenomena under study. The cur- 
rent trend away from rank empiri- 
cism is sound, but it can lead to an 
overemphasis upon exercises in for- 
mal logic at the expense of discovery 
of significant relationships in social 
phenomena. Another cause for con- 
cern in present trends is the lack of 





SPECIAL REVIEW 35 


thorough and sustained exploration 
of a related set of problems so that we 
can accumulate knowledge in an area 
and be in a position to make definitive 
statements about it. Thus far too 


much effort has been expended at de- 
fining or redefining a variable and 
demonstrating that it is operative at 
something better than a chance level. 


Finally, what is still needed is a the- 
oretical system which could handle 
the problems of social relationships 
and structure, not as they happen to 
be reflected in the psychological field 
of the individual, but as systems of 
events in the social field. 


Received February 1, 1955 
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Hocnu, Pau H., & ZuBIN, JOSEPH. 
(Eds.) Depression. New York: 
Grune and Stratton, 1954. Pp. 
ix+277. $5.50. 


Perhaps the main value of this 
book is that it brings together a wide 
assortment of material related to de- 
pressions and allied conditions. It 
consists of 16 independent papers 
originally presented at the 1952 
meeting of the American Psycho- 
pathological Society. Represented in 
the symposium are theories and find- 
ings from the fields of clinical psychi- 
atry, psychodynamics, anthropology, 
endocrinology, biochemistry, biomet- 
rics, and hospital administration. 
Attention is given to depressive 
reactions in children, the aged, 
cancer patients, and soldiers exposed 
to isolated Arctic conditions. Most 
of the articles are reviews of previ- 
ously published work. As might be 
expected from a symposium of this 
nature the individual chapters vary 
markedly in quality. Kallmann’s 
analysis of genetics in manic-depres- 
sive psychosis, Zubin’s review of 
biometric methods in psychopathol- 
ogy, and Steinbrook’s cross-cultural 
evaluation of depressive reactions are 
outstanding. Some useful criteria for 
the differential diagnosis of manic- 
depressive patients are contained in 
the chapter by Lewis and Piotrowski. 
Kalinowsky’s enthusiastic defense of 
electric convulsive therapy conveni- 
ently ignores the well-established 
finding—mentioned in an_ earlier 
naper—that the present irrational 
methods of shock therapy, at best, 
simply hasten the improvement of 
those who would improve anyway. 
It may be true, as maintained by 
Spitz, that institutionalized infants 
deprived for several months of emo- 
tional supplies normally provided by 
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the mother undergo progressive de- 
velopmental decline, but the absence 
of adequate experimental controls 
leaves room for doubt. Might it not 
be that infants kept in institutions 
for long periods are biologically less 
favored than infants who are adopted 
or placed in foster homes after a brief 
stay in an institution? 

Much still remains to be learned 
about depressions but in focusing at- 
tention on and in providing a wide- 
range view of the problem the APS 
has performed a useful service. 

JAMEs D. PAGE 

Temple University 


SAVAGE, LEONARD J. The founda- 
tions of statistics. New York: 
Wiley, 1954. Pp. xv+294. $6.00. 


Only the title of this provocative 
book will be easy reading for most 
psychologists. By “foundations of 
statistics” the author mainly refers to 
the axiomatic and logical bases of 
probability theory. After a brief in- 
troductiory chapter which notes three 
major points of view about the inter- 
pretation of probability (objectivis- 
tic, personalistic, and necessary), a 
personalistic or subjectivistic theory 
of decision in the face of uncertainty 
is developed in formal manner in- 
volving an assortment of definitions, 
postulates, theorems, lemmas, and 
corollaries. The theory is shown to 
imply the construct of utility in eco- 
nomic behavior. 

The second half of the book is de- 
voted to the impact of personalistic 
probability on minimax theory, the 
theory of games, point estimation, 
interval estimation, and the testing 
of statistical hypotheses. Overcon- 
scientious appliers of common pro- 
cedures in statistical inference may 
be somewhat traumatized by the 
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large component of arbitrariness, 
vagueness, and crudity attributed to 
the rationale of these procedures. 
LEONARD S. KOGAN 
Institute of Welfare Research, 
Community Service Society of 
New York 


RosBAcK, A. A. Destiny and motiva- 
tion in language. Cambridge, 
Mass.: Sci-Art, 1954. Pp. 474. 
$10.00. 


This is a book of many ideas about 
language. It will be read with more 
profit by the layman than by the 
scholar or scientist because it is writ- 
ten by an amateur (using this word 
in its exact sense) rather than by an 
expert philologist or linguist. 

Dr. Roback has been interested in 
language since the age of five when he 
asked his teacher whether the P’riz- 
zim were the same as Parisians. Had 
it not been for rebuffs from his lan- 
guage professors at McGill and from 
Miinsterberg at Harvard, Roback 
might have become a philologist or a 
researcher in the field of psycholin- 
guistics. In writing this book the 
author has returned to an early in- 
tellectual passion. Appropriately, 
the book bears the imprint of adoles- 
cent exhibitionism and enthusiasm 
for ideas more than it does the mark 
of mature and discriminating scholar- 
ship. I found it delightful reading. 

Destiny and Motivation in Lan- 
guage is a collection of 22 essays on 
a wide variety of subjects relating to 
language. I like particularly the 
chapters on destiny in names, shmoo 
and shmo, and the psychology of 
slang. Brutus was destined to be a 
brute, Boccaccio to be foul mouthed, 
Kierkegaard to be a gloomy grave- 
yard existentialist, Hitler (Schickl- 
gruber) to be a grave sender, Sir 
Russell Brain to be an authority on 
the brain, and the writer of this re- 
view to be bald since Calvin means 
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bald. (He is bald!) This is the sort of 
whimsical, tongue-in-cheek psycholo- 
gizing that people engage in after a 
few drinks and rarely take seriously. 
Perhaps they should! 

“Shmoo and Shmo”’ is a charming 
essay on the linguistic connection be- 
tween the name given by Al Capp to 
his fabulous little cornucopian crea- 
tures and the feminine form (shmooe) 
of the Yiddish word for the erect 
penis (shmock). Shmock is a hard 
sounding word; shmooe, shmoo, and 
shmo are soft sounding. This in es- 
sence is Roback’s vocosensory theory 
of the origin of language. The mus- 
cular pattern made by the mouth plus 
the sensory attributes of saying the 
word contain the meaning of the 
word. Words sound and feel like the 
things they refer to. Shmock sounds 
and feels like an erect penis, shmoo 
like a soft comforting vagina. Al 
Capp (born Caplin) unconsciously 
selected the feminine form of a com- 
mon Yiddish word, which he must 
have heard asa child, as the name for 
his supernurturant birds This essay 
reminds me of the perspicacious writ- 
ings of Kenneth Burke on the rela- 
tion of letter and word sounds to 
meaning. (How many psychologists 
are familiar with Kenneth Burke's 
game with words which he calls 
‘joycing’’ after the greatest modern 
wordmaster of them all?) 

In my opinion, the most valuable 
chapter in Roback’s book is the one 
on slang. In this essay, much of 
which was written forty years ago, 
Roback manages to sustain a line of 
thought long enough to reach some 
pretty interesting conclusions. One 


such conclusion is that slang is the 
rebel’s form of expression, whether 
the rebel be an adolescent, a criminal, 
or a member of a persecuted minority. 

Although this book is not an im- 
portant contribution to the psychol- 
ogy of language, it will be read with 
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interest by people who like the stim- 
ulus of fresh, off-beat speculation. 
Dr. Roback has a keen eye for the 
ironical, an affectionate feeling for 


words, and a highly permissive atti- 


tude toward ideas. It is pretty appar- 
ent that Destiny and M.-tivation in 
Language was undertaken as a labor 
of love. It deserves to be read and 
criticized with the same emotion. 
Cavin S. HALL 
Western Reserve University 


Cooper, Linn F., & ERICKSON, 
Mitton H. Time distortion in 
hypnosis. Baltimore: Williams & 
Wilkins, 1954. Pp. ix+191. $4.00. 


In this provocative little book the 
authors present the results of various 
investigations of hypnotically in- 


duced distortions of time perception 
and their clinical applications. Those 
of the readers who have not had the 
opportunity to read Cooper's earlier 
publications on this topic will find 
this book to be, for the most part, 


pleasant and fascinating reading. For 
those who are familiar with the afore- 
mentioned papers this monograph will 
be somewhat disappointing since the 
greater portion of its bulk duplicates 
the earlier reports. Nevertheless, a 
fair amount of previously unpub- 
lished experimental data has been 
added and the clinical material is en- 
tirely new. In addition, the material 
dealing with basic concepts and 
methodology represents an important 
expansion of the earlier work, and 
this should prove valuable to experi- 
menters interested in this area. 
Essentially, the subject matter 
deals with slowing down a person's 
perceived or subjective time through 
the use of hypnotic suggestions and 
the effects upon various mental func- 
tions. Data are presented which in- 
dicate that some functions may be 
speeded up. It is to be regretted that 
Cooper and Erickson have preferred 
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to cover a relatively wide range of 
functions in an exploratory manner 
rather than limit themselves to a les- 
ser number of more definitive stud- 
ies. The consequence has been that, 
although the reported data are super- 
ficially convincing, closer inspection 
tends to leave us wishing for some- 
thing more substantial. There is a 
serious lack of objective measures 
and quantified data, and many of the 
reported experiments tend to be 
methodologically oversimplified. The 
book would also have been more 
satisfactory if the authors had made 
a greater attempt to relate their work 
to current psychology. On the other 
hand, it should be kept in mind that 
the topic is admittedly a difficult one 
to deal with experimentally, and that 
Cooper and Erickson are essentially 
pioneering in a new area of investiga- 
tion. Taken asa totality the reported 
data do not permit us to categorically 
ignore hypnotic time distortion and 
further investigation is clearly war- 
ranted. 

Cooper devotes some space to the- 
orizing about a general mechanism 
for the action of hypnotic suggestions. 
In a somewhat laborious and circui- 
tous manner he reaches the conclusion 
that hypnosis imbues words with a 
particular effectiveness to evoke spec- 
ified experiences via their associated 
“meaning-tone.”” This is also the 
mechanism responsible for time dis- 
tortion. In this reviewer's opinion, 
this conclusion hardly seems to eluci- 
date the main question and leaves 
many other questions unanswered. 

In a sense the clinical material re- 
ported by Erickson may be the most 
convincing evidence for the reality of 
time distortion. The use of the latter 
seems to be potentially promising for 
brief psychotherapy and for the 
treatment of obdurate cases. It also 
brings to the fore certain temporal 
aspects of the process of therapy 
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which have been largely ignored to 
date. 

In final evaluation, the reviewer 
believes that, in spite of certain short- 
comings and its rather specialized 
nature, this monograph will be of 
interest to a wide group of readers. It 
represents an original attack upon 
the problem of temporal perception 
which cuts across major areas of psy- 
chology. The experimenter will dis- 
cover in it both a new tool and a rich 
field for further research. Theoreti- 
cians, particularly those with a philo- 
sophical turn of mind will find food 
for thought and for speculations. 
Finally, it gives the clinician and 
therapist an intriguing new approach 
to the problems of psychotherapy. 

ANDRE M. WEITZENHOFFER 

University of Michigan 


McKEon, RICHARD, et al. Jmnterrela- 
tions of cultures. New York: 
Columbia Univer. Press (Unesco), 


1953. Pp. 387. $2.50. 


This little book deals with eight 
cultures, particularly their historical 


origins and value systems. ‘‘Inter- 
relations’ are treated mainly in his- 
torical terms except in McKeon’s in- 
troductory essay and the final brief 
statement by a Unesco committee. 
The authors are predominantly his- 
torians and philosophers, and these 
points of view are central in the dis- 
cussion. Indian “psychology” is 
given a few pages (pp. 134 ff.), but 
they will not appeal to most academic 
psychologists in the United States. 
Griaule makes some very suggestive 
remarks about cognition and learning 
among African Negroes. There are 
some excellent—and to this reviewer 
quite fresh and new—treatments of 
Indian, Chinese, and African Negro 
esthetics. Anthropology enters prin- 
cipally into the consideration of 
African cultures; it gets short shrift 
on Japan (p. 121). 


357 


As always with volumes of this 
sort, the contributions vary greatly 
in interest and quality. The two 
pieces on American culture are ex- 
tremely thin. On the other hand, Zea 
on Spanish-American culture is pro- 
found and provocative. Atreya on 
India gives a more compact and in- 
telligible account of central themes 
than the reviewer has ever previ- 
ously encountered, Chatterji’s char- 
acterization of Indian culture also 
has force and point, though one feels 
he strains to present it in a manner 
that will fit best with certain currents 
in Western thought. 

In general, this volume will attract 
only those psychologists who are in- 
terested in the cross-cultural study of 
values or those who enjoy somewhat 
discursive essays seasoned with oc- 
casionally exotic facts. Yet the best of 
these chapters are extraordinarily 
thoughtful syntheses of vast bodies of 
material, and Unesco deserves thanks 
for making them available. 

CLYDE KLUCKHOHN 

Center for Advanced Study in the 

Behavioral Sciences 


ZISKIND, EUGENE. Psychophysio- 
logic medicine. Philadelphia: Lea 
and Febiger, 1954. Pp. 370. $7.00. 
Here is a book written for the 

general medical practitioner that will 
interest psychologists more by its 
purpose than by its content. Its 
premise is that one-third of the pa- 
tients seeking medical aid have symp- 
toms solely or primarily due to psy- 
chogenic pathology and another third 
combine psychogenic with physio- 
genic pathology. The requirement for 
psychodiagnosis and psychotherapy 
in the treatment of general medical 
disorders is clear indeed. 

Dr. Ziskind, head of the depart- 
ment of psychiatry and neurology at 
the Cedars of Lebanon Hospital in 
Los Angeles, has been coordinating a 





358 


training program for physicians to 
meet this need. He drew heavily on 
this experience in preparing the por- 
tions of this book dealing with diag- 
nosis and therapy. 

The book consists of four parts. 
The first one provides background 
and orientation to the problem. The 
second part is concerned with diag- 
nostic and psychotherapeutic tech- 
nique. The third part consists of a 
review of familial and other social 
factors in psychopathology, plus a 
general treatment of psychiatric syn- 
dromes and common psychiatric 
emergencies with which the physician 
may be confronted. The fourth part 
includes a useful review of contem- 
porary schools of psychiatric thought. 

The writing style is easy and lucid. 
In keeping with the aim of the book, 
there is an almost “do it yourself”’ 
quality to it. This is one of its short- 
comings. Skill, personality, and 
character requirements for effective 
psychotherapy are stated as precepts, 
as if the reading by the physician is a 
sufficient fulfillment of the require- 
ments. The requirements themselves 
are not at issue. Ziskind states that 
the personality and attitude of the 
therapist must be encouraging. He 
should be kindly, sympathetic and 
interested, permissively nonmoralis- 
tic, nonjudgmental, and nonauthori- 
tarian (p. 149). 

Within the framework of a super- 
vised training program at a hospital 
there is some modest assurance that 
these requirements may be met. As 
a minimum, a physician lacking these 
characteristics can be discouraged 
from further participation. Ziskind 
reports that the hospital program has 
been highly successful, and it may be 
assumed that the success is due in no 
small measure to the availability of a 
well-trained psychiatric staff. 

For a training program of this type 
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this book would be a useful syste- 
matic guide. However, the book is 
addressed to the practitioner at large. 
The wisdom of its use independently 
of supervision will be seriously ques- 
tioned. Ziskind believes that because 
psychophysiologic illness is so ram- 
pant, the physician is obliged to deal 
with it as best he can anyway and 
after reading this book the physician 
can approach this task with less 
anxiety. 

What is at issue is whether the 
patient is better off if he is exposed 
to the risk of therapeutic mismanage- 
ment. This is the dilemma posed by 
Ziskind's effort. The merits of his ef- 
fort are so obvious that it would be 
highly valuable to assess the magni- 
tude of the risk empirically even 
though this may be a large under- 
taking. 

Against such data as a background, 
other solutions may also be consid- 
ered. These would include the over- 
due modification of medical school 
curricula and the establishment of a 
more cooperative working relation- 
ship between the medical and psycho- 
logical professions. 

JoserH E. BARMACK 

College of the City of New York 


Murpuy, G. & BACHRACH, ARTHUR 
J. An outline of abnormal psychol- 
ogy. (Rev. Ed.) New York: Random 
House, 1954. Pp. ix+597. $1.45. 


This is a revision of Dr. Murphy’s 
1928 contribution to The Modern 
Library Series and almost everything 


but the title has been changed. Dr. 
Murphy gives most of the credit for 
the selection of the material to his co- 
editor. The first chapter is a Gardner 
Murphy original. In addition to 
chapters on the dynamics of schizo- 
phrenia, psychoneuroses, and para- 
noid disorders, there are chapters on 
specific topics such as adolescent 
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conflicts and adjustment, obesity, 
the addictive drinker, psychological 
factors in epilepsy, the individual and 
society, Rorschach variables and 
ESP scores, mongolism, and the 
treatment of the withdrawn child. 
The editors have selected published 
material which has breadth, reader 
importance, technical accuracy, and 
simplicity of report. The unity of the 
book is striking, in view of the wide 
range of subject matter and the 
sources of the papers included. An 
appropriate and helpful giossary is 
appended. The volume would have 
been improved by an index, espe- 
cially since it is suitable for college 
classes. 

This is not just another book on 
abnormal psychology, as the title 
may suggest, but a collection of read- 
ings which may be used either to sup- 
plement existing texts, or alone, as an 
aid to stimulating insight and under- 
standing in the searching layman. As 
psychology’s role in the community 
continues to be enlarged, there will 
be more and more need for authentic, 
readable, and relevant books such as 
this for the average reader. 

FRED MCKINNEY 

University of Missouri 


Homer, HARRY. Language in culture. 
Chicago: Univer. of Chicago Press, 


1954. Pp. xi+286. $4.50. 


The late Benjamin Whorf’s dramat- 
ic presentation has interested a large 
part of the intellectual world in the 
hypothesis of “linguistic relativity.” 
Whorf maintained that the cognitive 
style of a people is in correlation 
with, and perhaps determined by, 
the lexicon and structure of the 
language spoken by that people. In 
1953, at the University of Chicago, a 
group of scholars representing linguis- 
tics, anthropology, psychology, and 
philosophy discussed for one week the 
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“relativity’’ thesis cf Whorf. The 
report of the conference comprises 
seven papers and nine discussion 
sessions. 

With Whorf, as with Freud, no 
reading, however careful, can tell us 
exactly what the author meant if ex- 
act meaning implies operational defi- 
nition. The exegesis of Whorf is a 
principal topic of the conference. 
This portion of the discussion may 
seem to be wasted since the problem 
of Whorf's exact intentions is, after 
all, a rather trivial historical ques- 
tion. In fact, however, each dis- 
cussant takes this opportunity to 
propound his own views and receive 
criticism of them, letting Whorf 
absorb the odium for whatever may 
be said that cannot survive criticism. 

Whichever of several things Whorf 
meant to say the conference members 
generally agree that his data (largely 
from his study of Hopi) are suggestive 
rather than conclusive. ‘Linguistic 


relativity’ is a collection of hypoth- 


eses about language, culture, and 
cognition. The conference accord- 
ingly gives some time to the design 
of research intended to test these 
hypotheses. Hoijer makes an in- 
teresting proposal for comparing 
general cultural similarities with lin- 
guistic affiliations among the 
Navaho, Hopi, and other American 
Indians. Hockett gives the best feel- 
ing for method and also provides 
valuable examples in his comparison 
of certain limited features of English 
and Chinese. In a final discussion 
research ideas are rather carelessly 
tossed out. Several of these are 
promising (notably those of Louns- 
bury and Kaplan) but the reader is 
disappointed by the failure of the 
conference to think through any one 
idea to a point where its practicabil- 
ity and merit can be judged. 
Hoijer has condensed and edited 
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some 919 pages of typescript to ob- 
tain the 150 pages of discussion that 
appear in the book. These discussions 
read smoothly and are generally 
intelligible. However, the trans- 
cribed diaglogue of ideas never has 
the sustained direction and thought- 
fulness of a literary construction. 
The unfortunate fact is that not even 
every other idea expressed in a con- 
ference is worth recording. As a 
result this book makes an extremely 
uneconomical presentation of a small 
number of good ideas. 
RoGer W. Brown 
Harvard University 


K.Loprer, B., Atnswortu, M. D., 
K.Loprer, W. G., & Hort, R. R. 
Developments in the Rorschach tech- 
nique. Vol. |. Technique and the- 
ory. Yonkers, New York: World 
Book, 1954. Pp. x +726. $6.50. 


There is something for everyone in 
this first volume of a two-volume 
work on the Rorschach. Probably the 
primary use of the book will be as a 
teaching manual for students learn- 
ing the mechanics of scoring and the 
feeling for interpretation, but even 
experienced users of the technique 
may read with profit the sections on 
theory and interpretation. 

The sections on administration 
and scoring are written clearly and 
numerous examples are provided to 
clarify scoring problems. Few changes 
in the scoring system presented 
twelve years ago in The Rorschach 
Technique have found their way into 
the present procedure. There is some 
clarification of methods of scoring 
shading responses though these dis- 
tinctions are likely to continue to be 
frustrating to students. Ten popular 
responses are listed—the same ones 
as twelve years ago, and a slightly 
different list from that offered by 
other experts. The form-level rating 
procedure, an attempt to give a nu- 
merical score for each percept reflect- 
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ing its accuracy, will continue to baf- 
fle statisticians and others with a 
bias against adding, subtracting, and 
averaging cabbages and kings. 

The sections on interpretation are 
well written and will stir little contro- 
versy among experienced users of the 
technique. Some readers are certain, 
however, to criticize the failure of the 
authors to support their statements 
with data in the literature. The au- 
thors, anticipating this objection, 
present their interpretations as “‘hy- 
potheses”’ and indicate that the use- 
fulness of these hypotheses has been 
established in clinical practice and 
that they may be used ‘‘pending the 
conclusion of the extensive investiga- 
tions that will be necessary to evalu- 
ate their relative validity.’’ Appar- 
ently existing validation studies do 
not fit their criteria. Certainly clini- 
cal use has resulted in few changes in 
interpretation. The authors also in- 
dicate, wisely, that ‘‘the integration 
of the findings into some sort of 
meaningful dynamic picture depends 
largely upon the examiner’s basic 
understanding of human personal- 
ity.”” Again, there is much in the 
literature which could be used to 
document this hypothesis. 

The section on validation problems 
makes a case for the point of view 
that “there is no sharp dividing line 
between validation research and the 
clinical use of the Rorschach tech- 
nique ... indeed, the technique it- 
self is continually changing through 
the validation process.’’ A test of 
this latter hypothesis through a com- 
parison of scoring and interpretation 
procedures by the same authors over 
the years, and by an examination of 
possible gradual rapprochement be- 
tween different schools, would possi- 
bly raise some doubts here. The sec- 
tion on validation concludes with the 
statement that ‘‘validation research 
should be planned and the results dis- 
cussed by those who have more than 
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a superficial understanding of the 
basic concepts involved in Rorschach 
interpretative hypothesis. ...°’ Few 
will disagree, though the statement 
leaves a nice opening for a rejoinder 
by the statisticians! 

To single out these statements does 
not give an accurate picture of the 
generally excellent discussion of the 
problems of validation and of exist- 
ing approaches to these problems. 

Also included is a brief, though 
stimulating chapter on implications 
of contemporary personality theories, 
a somewhat more esoteric chapter on 
ego psychology, a long case study, 
and a Rorschach Prognostic Rating 
Scale (experimental) designed to pre- 
dict a patient’s response to psycho- 
therapy. This latter may be just 
what Eysenck has been looking for, 
though again it may not. It would be 
interesting to hear him discuss it 
with the authors at a symposium. At 
the end of the book, as an appendix, 
is an added chapter on Freudian and 
Jungian approaches to the develop- 
ment of ego and self. The author is 
Jack Fox, who is not represented on 
the masthead, perhaps because there 
is no mention of the Rorschach in the 
chapter. 

The book contains no general list 
of references though there is a bibli- 
ography at the end of many chapters. 
Those interested in or favoring par- 
ticular cards of the Rorschach will 
find a list of the pages on which each 
card is mentioned. 

In a work of this scope it is easy to 
find statements or points of view 
with which one can disagree. Also 
with a second volume in the offing 
one may find his objections concern- 
ing lack of some form of content 
answered with the companion vol- 
ume. There is little doubt that the 
present work will find a wide audi- 
ence and deservingly so. 

GEORGE W. ALBEE 

Western Reserve University 
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Katz, D., CARTWRIGHT, D., ELDERS- 
vVELD, S., & Lee A. McC. (Eds.) 
Public, opinion and propaganda. 
New York: Dryden, 1954. Pp. 
xx+779\ $6.25. 


The Society for the Psychological 
Study of Social Issues has produced 
an excellent book of readings on pub- 
lic opinion and propaganda. In keep- 
ing with its interdisciplinary orien- 
tation, a wide range of the relevant 
literature in political science, history, 
anthropology, sociology, economics, 
and psychology has been culled to 
provide a selection of readings (74 in 
all) showing the possibilities of the 
empirical approach to problems in 
the area as well as the societal con- 
text, the political structure, and the 
social-psychological dynamics of 
opinion formation. There is as wide 


a coverage of points of view, types of 
studies, and problems, both practical 
and theoretical, as one has a right to 
anticipate in a book of readings in- 


tended for courses in public opinion 
and propaganda. On the basis of the 
sampled advanced work included in 
the collection, one wonders, however, 
whether the general area of public 
opinion and propaganda will acquire 
that eventual “scientific” status 
which the editors suggest such work 
may adumbrate. ‘Control’ and 
‘‘number” are not enough. After all, 
Ptolemaic theory also had its systems 
of quantification. 

IvAN D. Lonpon 

Brooklyn College 


McMILLan, B., Grant, D. A., Fitts, 
P. M., Frick, F. C., McCuLtocn, 
W. S., Miter, G. A., & Brostn, 
H. W. Current trends in informa- 
tion theory. Pittsburgh: Univer. 
of Pittsburgh Press, 1953. Pp. 
xiii+188. $4.00. 

This is the seventh in a series of an- 
nual conferences on Current Trends in 

Psychology arranged by the Depart- 
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ment of Psychology of the University 
of Pittsburgh. The seven partici- 
pants and the titles of their contribu- 
tions are: McMillan, ‘‘Mathematical 
Aspects of Information: Theory”; 
Grant, ‘Information Theory and the 
Discrimination of Sequences in Stim- 
ulus Events’’; Fitts, ‘‘The Influence 
of Response Coding on Performance 
in Motor Tasks’’; Frick, ‘‘Some Per- 
ceptual Problems from the Point of 
View of Information Theory”; Mc- 
Culloch, ‘Information in the Head”’: 
Miller, ‘‘Information Theory and the 
Study of Speech’; and Brosin, ‘*In- 
formation Theory and Clinical Medi- 
cine (Psychiatry).”’ 

As is frequently the case with such 
compendia, this one is rather a hodge- 
podge with different authors trying 
to do different things in different 
ways. The unifying theme, such as it 
is, is modern communication theory. 
McMillan attempts a brief and gen- 
eral account of communication the- 


ory, in which attempt he is very suc- 
cessful. It is gratifying to know that 


mathematicians can communicate 
with psychologists if they want to. 

Grant talks about a number of ex- 
periments dealing generally with par- 
tial reinforcement and the discrimina- 
tion of stimulus frequencies occurring 
with various probabilities. Although 
the experiments are interesting in 
their own right, the attempt to relate 
the results to information theory is 
strained. It is hard to escape the feel- 
ing that it would have been much less 
cumbersome to talk about these 
studies in other terms. 

Fitts is concerned with the speed 
and accuracy of discrete and continu- 
ous movements and with stimulus- 
response compatibility. He reports 
some exceedingly ingenious experi- 
ments but it is difficult to know how 
much their conception is in debt to 
information theory. At any rate, it is 
possible to argue that the principal 
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contribution of information theory to 
these results has been to suggest the 
use of logarithms. 

McCulloch has an interesting sur- 
vey of some neurophysiological prob- 
lems of sensation and perception and 
Brosin talks about some problems in 
clinical psychology and psychiatry. 
In both cases the principal connection 
with information theory has been the 
translation of some common psycho- 
logical and physiological terms into 
some uncommon ones, e.g., ‘‘quan- 
tize,”’ “modulate,” and the like. If 
both authors had applied terms ordi- 
narily used in their respective fields, 
their discussions would undoubtedly 
have contained as much information 
and would probably have been better 
communications. 

Of all the authors only Frick and 
Miller describe some classes of psy- 
chological experiments to which in- 
formation theory has been able to 
contribute something other than a 
new vocabulary. Miller is especially 
levelheaded in his appraisal of the 
usefulness of information theory for 
psychology and a paraphrase of the 
first two pages of his contribution 
might well summarize the outcome of 
this conference: There are not yet 
many places in the study of human 
behavior where information theory 
can be profitably applied. 

Since the chances are good that you 
do not agree with the reviewer's ap- 
praisal of information theory, you 
ought to read the book yourself. It is 
worth reading. 

A. CHAPANIS 

The Johns Hopkins University 


ZiLBooRG, G. The psychology of the 
criminal act and punishment. New 
York: Harcourt, Brace, 1954. Pp. 
xi+141. $3.50. 


This short book grew out of a Yale 
lectureship under the joint auspices 
of schools of law and medicine. It 
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examines the motivation and conse- 
quences of punishment of the criminal 
and points to the ‘‘curative powers of 
healthy self-punishment which never 
becomes hostility.” 

It will make its major contribution 
to the practicing professions, but 
psychologists acquainted with the 
hypotheses reviewed here will wel- 
come their emphasis by a writer who 
is well regarded in his own and re- 
lated fields. Zilboorg presents ¢xcel- 
lent suggested courtroom ethics for 
the representatives of the behavior 
sciences. 

FRED MCKINNEY 

University of Missouri 


WOLSTEIN, BENJAMIN. Transference: 
its meaning and function in psycho- 
analytic therapy. New York: Grune 
& Stratton, 1954. Pp. xiii+206. 
$5.00. 


Trained in philosophy, psychology, 
and Sullivanian analysis, Welstein 
has given thought to the rich source 


of fantastic data brought to! light 
chiefly by methods stemming from 


Freud: dreams, free association, 
“transference” observations—-areas 
largely avoided by experimentalists. 
He also understands operationism 
and precise models of science., His 
book demonstrates well the feasibil- 
ity of bringing into scientific con- 
sciousness those analytic concepts 
which sometimes seem to have come 
up dripping with the amorphous 
illogic of their dreamworld origin. 
His review of Freud's and later 
analytic thinking suggests | that 
Freud saw transference as intantile 
behavior toward parents ‘“‘trans- 
ferred’’ to a present situation. Wol- 
stein sees transference in terms of 
observable here-and-now behavior. 
Freud pursued relentlessly the re- 
constructed memory of an event from 
early infancy which ‘‘determined”’ 
the neurosis; using first hypnotism, 
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then strong suggestion, and finally 
an authoritarian exploitation of “pos- 
itive transference.’’ Negative trans- 
ference he battered down as re- 
sistance, and  countertransference 
was not admitted to the patient. 
Wolstein accepts extrapolative re- 
construction of infantile events, but 
also emphasizes later developmental 
stages. He keeps the focus on present 
interactions with careful attention 
to the therapist as an interbehav- 
ing organism (countertransference) 
rather than as a hypnotic authority. 

The book reflects the groping 
trends of opinion developing from 
therapeutic experience and from in- 
creased awareness of the problems 
of definition. Many questions related 
to transference are brought closer to 
the status of answerable questions. 

Roy M. HAMLIN 
Western Psychiatric Institute 
University of Pittsburgh 


Wever, E. G., & LAWRENCE, M. 
Physiological acoustics, Princeton: 
Princeton Univer. Press, 1954. 
Pp. xii+454. $10.00. 


This new volume from Professor 
Wever's laboratory can be immedi- 
ately recommended to two classes of 
readers: the few who eat and sleep 
audition, and the many who need a 
clearly written text to keep them 
posted on the latest facts plus 
authoritative comment from a logical 
mind on the meaning of those facts. 

For workers already immersed in 
the area, the book provides two 
values. It brings together much of 
the material Wever and his asso- 
ciates have scattered over scores of 
papers in the acoustical, otological, 
and psychological literature; and it 
also presents new data and fresh 
comment on old data, both of the 
very greatest interest. 

The plan of the book is to examine 
the transmission of acoustic energy 





364 


from air-borne’ vibrations clear 
through the system up to but not 
including the primary neuron. After 
the three short introductory chapters, 
well over two-thirds of the remaining 
375 pages is devoted to the system 
up to but not including the cochlea. 
This emphasis on the drum, the 
ossicles, the tympanic muscles, prob- 
lems of bone conduction, of the 
relative efficiency of routes to the 
cochlea, and of clinical problems in 
sound conduction form a _ salutary 
change from a common notion that 
“auditory theory’’ concerns only 
basilar mechidiiics and associated 
neurology. There is much besides a 
dramatic concentration on cochlear 
patterns to engage the attention of 
an auditory theorist. 

This book makes clear how Pro- 
fessor Wever has focused the full 
attention of his laboratory for a 
quarter century on the whole prob- 
lem of sound conduction. Beginning 


at the eardrum and proceeding to in- 
tracochlear electrodes, every phase of 
the subject has been touched upon 
by him in some, often crucial, ex- 
periment. The result is a treatise in 
the best tradition. Certain problems 


unsolved since Helmholtz have 
yielded to Wever’s experimental tech- 
nique. A good example is his chapter 
on the eardrum as a conical lever. 
He first summarizes Helmholtz’s 
analogue of the catenary as the 
principle by which the force applied 
to the malleus is amplified, marshals 
the anatomical evidence on the 
elasticity of the radial and circular 
fibers, and then repeats Helmholtz’s 
crucial experiment with a technique 
enabling him to move the manu- 
brium of the malleus accurate to 
.0005 mm. As a result, auditory 
scientists will never again seriously 
consider the eardrum as an ampli- 
fier. 
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As an example of the care and 
logic exercised in Wever and Law- 
rence’s work, one may cite the series 
of studies on the function of the 
round window. Although Eduard 
Weber long ago had the generally cor- 
rect notion, numerous attempts by 
several modern laboratories io 
document this notion had proved 
ambiguous. It remained for Wever 
and Lawrence to concentrate their 
attack on the round window with 
quite nice techniques of blocking, 
probing, loading with an air column, 
with and without ossicles present, 
and to show in precise quantitative 
fashion the proportionate contribu- 
tions of mass and stiffness to the 
system, as they affect high-tone and , 
low-tone responses. Thus not only 
is the main function of the round 
window logically proved, but the de- 
tails are expressed in quantitative 
fashion and a rational explanation 
offered. Finally, the failure of earlier 
studies to produce conclusive results 
is very civilly given by inference, 
and a true explanation for their data 
is at the reader’s hand for the first 
time. 

The medical significance of 
Wever’s decades of work on sound 
conduction is best appreciated by 
otologists, to whom he often per- 
sonally addresssed himself and who 
have taken the opportunity to under- 
write some of his efforts. 

To psychologists, the features of 
Wever’s work of most significance 
are his discovery of the cochlear 
microphonic (with which of course 
the name of C. W. Bray is always 
connected as well), and his insistence 
on the unsatisfactory nature of 
“place’’ as the only principle of 
cochlear analysis. In his book 
Theory of Hearing he fully discussed 
the reasoning which led him never to 
neglect the fact that a nerve fiber 
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always conveys information by its 
periodicity as well as by its anatom- 
ical location, and which led him to 
assimilate the characteristics of both 
space and time in a unified theory. 
The wording here is reminiscent of 
Harvey Fletcher's space-time theory, 
but there are real differences. In 
Wever’s view a fast-acting pressure 
wave sets up a geometric vibration 
pattern on the basilar membrane, 
and this pattern is sufficient taserve 
as the intermediary between the 
oscillatory pattern of the stapes and 
the space-time pattern of nerve 
response. In Fletcher's view, and in 
that of many investigators today, the 
pressure wave serves mainly to 
initiate a second, much slower, travel- 
ing wave moving up the basilar 
membrane much like a_ vibration 
along a shaken rope, and the char- 
acteristics of this traveling wave 
determine the neural space-time pat- 
tern. 

It must be said that direct obser- 
vation and certain intracochlear mul- 
tiple-electrode studies seem to sup- 
port the existence of a relatively 
slow traveling wave. Nevertheless, 
‘ Wever and Lawrence offer the possi- 
bility of another interpretation, and 
do not regard the wave as proved 
beyond question. In addition, they 
offer a most ingenious and carefully 
controlled experiment showing that 
the cochlear pattern is very similar 
no matter whether sound is intro- 
duced by a tube over the oval window 
or by a tube over a little hole drilled 
in the apex. It is admittedly most 
difficult to explain this result by a 
slow-traveling wave, as indeed it is 
difficult to explain many of the ex- 
tremely rapid phenomena of audition 
with a slow wave of any sort. In 
these matters, Wever's theory comes 
off very well. 

Many will wonder just how and in 
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what region Wever would have the 
frequency principle for low tones 
give way to the place principle for 
high tones; Wever will simply state 
that such questions are properly 
the subject of experiment not argu- 
ment. Moreover, it may be that the 
experiment of introducing sound at 
the apex is not crucial; one can be- 
lieve, for example, that the con- 
struction of the apical end of the 
basilar membrane and associated 
spiral ligament is not conducive to 
being snapped like a rope, and that 
the pressure wave beginning at the 
apex and hitting the oval and round 
windows within about 20 micro- 
seconds, is reflected back up the 
canals and initiates a slower travel- 
ing wave at the usual end of the 
basilar membrane. Thus a sine wave 
impressed upon the fluid at opposite 
ends could well initiate two travel- 
ing waves following each other with 
only a slight phase lag, well within 
the limits of measurement, and it 
would be possible to create cancella- 
tion effects quite generally through- 
out the canals, as indeed Wever and 
Lawrence found. 

Suffice it here only to say that the 
data on the question as to the emer- 
gence of the pattern of excitation in 
the cochlea, the very voluminous 
data from several laboratories since 
the publication of his book of five 
years ago, have not caused Wever 
to change his mind as to the necessity 
for a synthesis of both principles, 
space and time. And in the discus- 
sions which are now progressing in 
the several journals, Wever, by his 
persistence and elegance in the lab- 
oratory and in the literature, makes 
quite as good a case as anyone. 

J. Donato Harris 

U. S. Naval Medical Research 

Laboratory, 
New London, Connecticut 
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HALL, CALvIN S. A primer of Freud- 
ian psychology. Cleveland: World, 
1954. Pp. xii+137, $2.50. 


Hall’s small and unpretentious 
volume is in most respects just what 
the author and publisher claim it to 
be—a relatively nontechnical, yet sys- 
tematic primer of Freud's theory of 
personality. A second claim—that 
its material is selected from and re- 
flects Freud's dynamics of normal 
rather than abnormal psychology— 
seems to be a bit overdrawn. Al- 
though Freud certainly distinguished 
between normal and abnormal states, 
it is equally certain that he did not 
invoke different dynamic constructs 
to account for each of them. How- 
ever, the structure of Hall's book is 
not vitiated by the artificial dichot- 
omy of dynamics which he intro- 
duces, nor is its value lessened as a 
singularly clear, comprehensive, and 
unadorned account of essential 
Freudian theory. 

In his introductory chapter Hall 
gives a brief and informal picture 
of the steps which led Freud from 
biological research to psychology. 
Apart from this account in which his 
admiration for Freud, the Scientist, 
is frankly explicit, the author's 
own views are never obtrusive. They 
are in fact, hardly discernible. This 
anonymity of “reporting’’ is one of 
the many positive features of the 
Primer. Hall's careful organization 
of topics, his succinct introduction 
to each major new one, and his brief 
but precise summaries should make 
this book a prototype. 

The lay reader will get a reason- 
ably good overview of Freudian 
theory from the book and will be 
assisted further by the list of recom- 
mended readings furnished him. For 
the student, the book should provide 
an excellent frame of reference for 
further study. The topical list of ref- 
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erences to Freud’s writings will be 
of real assistance to him. 
Dan L. ADLER 
San Francisco State College 


FAIRBAIRN, W. RONALD D. An ob- 
ject-relations theory of the person- 
ality. New York: Basic Books, 
1954. Pp. xi+312. $5.00. 


In a series of lectures delivered in 
the 1940's, Fairbairn developed what 
for him was a renovation of Freud's 
libido theory and Abraham’s ‘“‘de- 
velopmental phase’’ theory. It is 
these lectures which appear in the 
first portion of this book; in succeed- 
ing sections there appear three case 
histories which are used to illustrate 
the author’s resynthesis, and a group 
of miscellaneous papers. 

If one were to use the criterion of 
internal consistency in evaluating the 
theoretical contribution of the au- 
thor, he would come off very well in- 
deed. From lecture to lecture, Fair- 
bairn unfolds a program of interpre- 
tation which is essentially free from 
non sequitur, and carefully avoids 
the loose ends which often charac- 
terize this method of presentation. 
The premise is stated in the observa- 
tion that schizoid states are found in 
practically all psychopathological 
cases, and by implication must de- 
velop early in life. They occur, there- 
fore, as resultants of varied relation- 
ships between the person and objects 
which he seeks in response to libidinal 
prompting. Using these ‘‘object-rela- 
tions’’ as a key concept, Fairbairn 
proceeds to reorganize classical inter- 
pretations of such things as melan- 
cholia, impulse-psychology, and the 
holy trinity of ego, id, and superego. 

If one were to evaluate this New 
Look by other than consistency cri- 
teria, however, it is doubtful that the 
concensus would be so favorable. For 
many classical psychoanalysts, Fair- 
bairn will probably represent a par- 
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ticularly repugnant type of heresy in 
his disavowal of “accepted’’ basic 
principles. To the neo-Freudian, his 
dynamics may appear too far re- 
moved from the social world to be 
considered practicable. Finally, for 
those who demand of a conceptualiza- 
tion that the referents be clear, and 
that labels and names not be treated 
as entities, there will be little incen- 
tive to read beyond the first chapter. 

Among the miscellaneous papers 
in the last section there is one which 
might be read as an object lesson in 
righteous indignation. Fairbairn 
argues here that there is strong dis- 
crimination against the acceptance 
of psychoanalysis (and surrealism) 
in academic curricula, from which it 
follows that ‘‘. . . the higher the cul- 
tural value of a subject in the univer- 
sity curriculum, the less freedom of 
thought and enquiry is permitted”’ 
(p. 253). The scale by which he meas- 
ures cultural values is not described. 

Dan L. ADLER 
San Francisco State College 


SHELDON, WILLIAM H., in collabora- 
tion with Dupertutis, C. WESLEY, 
& McDermott, EUGENE. Allas of 
men. New York: Harper, 1954. Pp. 
xvi+357. $10.00. 


‘This Atlas is organized as a work- 
ing guide to be used in conjunction 
with Sheldon’s earlier volumes, The 
Varieties of Human Physique and The 
Varieties of Temperament. It 


con- 
tains somatotype photographs of 
1,175 men, selected as representative 
of the 88 somatotypes idéntified in 
the total sample of some 46,000 men 
ranging in age from 18 to 65 years. 
As the book is primarily a supple- 
ment to the earlier volumes, it is 
necessary to turn to them for de- 
tailed descriptions of the criteria for 
ratings, unless one can learn, by re- 
peated leafing through the photo- 
graphs, to identify the differentiating 
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factors between, say, a 2 and a 3 in 
ectomorphy. Each somatotype, in 
addition to photographs of several 
examples at different ages, is pre- 
sented with (a) a thumbnail sketch, 
largely in fanciful analogy to an ani- 
mal ‘‘totem”’ that is intended to char- 
acterize both the physique and its 
personality, (6) a chart showing the 
relative position of this and similar 
somatotypes, (c) a description, in- 
cluding the incidence of this type in 
the population, and the kinds of path- 
ology, both physical and mental, 
that are most characteristic of it, and 
(d) a table giving weight for height 
and age, anda chart of the H/W/ Win- 
dex by age, for purposes of evaluating 
nutritional status according to so- 
matotype. 

Appendices include a glossary; a 
set of tables for distributions of so- 
matotypes, by age, according to the 
H /</W (which may be used as a guide 
for identifying somatotypes); a des- 
cription of equipment and procedures 
(including a chart for converting 
weight and height into the H/*W/W 
index); and a table giving the case 
numbers of subjects in the Atlas 
having varying degrees of “‘gynandro- 
morphy” and ‘“‘texture,”” the two 
relatively independent variables. 

New data on this system of somato- 
typing are discussed briefly. In a 
consideration of the sex factor, 
Sheldon defends his use of the same 
standards for both sexes, and pre- 
sents charts showing the differences 
in distribution of the somatotypes 
of men and women. The females 
predominate in the areas of endo- 
morphy-ectemorphy, with no ecto- 
morphic mesomorphs, while the 
males have a greater variety and 
spread of types, the mode being high 
in the mesomorphic component. An 
Atlas of Women, a companion vol- 
ume to this one, is in preparation. 
In dealing with the age factor Shel- 
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don restricts his discussion to the 
span 18 through 65 years, but we 
find that an occasional boy under 
this age, as young as 11 years, has 
been slipped in among the illustra- 
tions. This seems to me a dubious 
procedure, because body-proportions 
change so markedly in boys between 
11 and 18 or 21 years. Lacking 
longitudinal data on the men, the 
author has made use of his subjects’ 
reported weight histories, together 
with such casual photographs as 
they could supply, to project the 
curves backward in time. Each 
somatotype has a_ characteristic 
curve, as some naturally put on more 
weight than others. ‘These curves 
seem plausible, but they need verifi- 
cation by actual repeat measures 
and assessments of the same men as 
they grow older. There is a discussion 
of the effect of nutrition, in which 
Sheldon defends his system against 
critics who have shown that such 
things as semistarvation alter the 
apparent somatotype. 

For those who are concerned with 
the problem of physique and _ its 
relations to personality and to dis- 
ease, this book is provocative and a 
challenge to further research on the 
hypotheses and assumptions pre- 
sented. For those who use the Atlas 
as a tool or guide for evaluating 
subjects, diligent study of the many 
illustrations should contribute to 
proficiency in somatotyping. How- 
ever, it would be easy for the uncrit- 
ical to overvalue the validity of 
Sheldon’s assumptions. Although at 
the outset he is careful to call his 
work a “‘pilot study’ and to point 
out the need for more validating 
procedures, the descriptions of each 
somatotype carry the ring of cer- 
tainty, e.g., of somatotype 1 3 6 he 
says, “‘Flycatchers. Frail lightly built 
birds who yet are fierce-hearted 
hunters. ... When found in the men- 
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tal hospital . . . the 1 3 6is most often 
labeled paranoid schizophremia. .. . 
But if the 1 3 6 is sometimes paranoid 
he is often heroic. For wherever 


paranoia dwells, there also is the 
home of heroic devotion” (page 45). 
NANCY BAYLEY 

National Institute of Mental Health 


Henry, ANDREW F. & SHort, JAMES 
F. Jr. Suicide and homicide. 
Glencoe, Ill.: The Free Press, 1954. 
Pp. 214. $4.00. 


Subtitled ‘Some Economic, Socio- 
logical and Psychological Aspects of 
Aggression,” this book is by two 
sociologists: Henry is from Harvard 
and Short from Washington State. 
It treats suicide and homicide as 
dependent variables and seeks to 
relate them to the business cycle, 
forms and degrees of social control, 
and aggressive tendencies against 
self or others. In five appendices they 
appraise previous research, examine 
statistical techniques for studying 
business cycles, and use these tech- 
niques to study the relations between 
cycles of suicides and of business, 
and offenses against property. 

Henry and Short conclude that 
suicide rises in depression and falls in 
prosperity whereas homicide rises in 
prosperity and falls during depres- 
sion. Both these tendencies are asso- 
ciated with social status——suicide 
with high and homicide with low 
status. As regards sociological vari- 
ables, the authors study what they 
call “strength of the relational sys- 
tem’’ defined in terms of marital 
status, urban-rural residence, and 
ecological distribution. They con- 
clude that the greater the strength 
of the relational system, the lower 
the suicide rate and the higher the 
homicide rate. 

In the chapter on psychological 
determinants, Henry and Short study 
the differential superego structures of 
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the suicidal and the homicidal and 
find that suicide is correlative with 
high guilt and homicide with low 
guilt. 

This ambitious work by Henry and 
Short aims to build on Durkheim's 
theory and the empirical research of 
more recent Americans. The inter- 
relation of the types of variables con- 
sidered by the authors seeks to solve 
questions of the association of super- 
ego structure with the combined 
effect of economic stress, social 
status, and the strength of the rela- 
tional system. Thus, for example, 
the authors find that, with status 
held constant, the relationship be- 
tween suicide, homicide, and the 
business cycle indicates that high 
status categories experience frustra- 
tion during business contractions 
while the lowest status category 
experiences frustration during busi- 
ness expansions. The vast body 
of material that Henry and Short 
seek to handle in short compass 
leaves one with the impression that 
more has been studied than has yet 
been assimilated, but that broad 
avenues of investigation have been 
opened up. Since in studies of suicide 
and homicide all associations are best 
shown in terms of rates, we are 
forced to reflect on why everybody 
in certain categories—age, marital 
status, socioeconomic status, race, 
ethnicity, religion—where rates have 
been differentially established does 
not commit suicide and homicide. 
We might then be forced to conclude 
that certain personality types are 
suicide-prone or homicide-prone and 
then would have to seek to discover 
how and why these types show up in 
different ascriptive categories and 
what differential socializing patterns 
are involved in those prone and those 
not prone. 

GEORGE SIMPSON 

Brooklyn College 


369 


LLOWENFELD, MARGARET. The Lowen- 
feld Mosaic Test. London: New- 
man Neame. 1954. Pp. 349+144 


color plates. 50s. (available in U.S. 
from Psychological Corp., 
York City) 


New 


The Lowenfeld Mosaic Test has 
been in use for tvinty-five years, 
and articles about it have appeared 
for fifteen years, but this is the first 
full-length treatment of the test. It 
has been long overdue, since the test 
is now rather widely used. This book 
will probably do for the Mosaic 
Test what the English translation of 
the ‘‘Psychodiagnostik”’ did for the 
Rorschach: it will transform it from 
an esoteric technique into a popular 
clinical instrument. And, as in the 
case of the Rorschach, some psy- 
chologists will not be too happy about 
this. 

The resemblance of this volume to 
the ‘Psychodiagnostik"’ is marked. 
Consciously, or unconsciously, Dr. 
Lowenfeld seems to have been in- 
fluenced by Rorschach in the ap- 
proach to the study of personality, 
the medium employed, the rationale, 
the intuitive appraisal of personality, 
the use of single cases as prototypes, 
and even the important elements of 
the test, such as color, form, move- 
ment, space, content, and many 
others. An important difference is 
that in the Rorschach the subject 
is faced with completed designs while 
in the Lowenfeld test, the subject 
constructs the design. 

The book is divided into two parts: 
Part I deals with the description of 
the test, including the origin, ad- 
ministration, and principles of clas- 
sification of designs produced by 
subjects. Part II describes the use 
of the test with children, subnormals, 
normals, neurotics, organics, and 
psychotics, and discusses its use in 
the study of cultural problems. 
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Diagnostic procedures are presented 
tentatively throughout, and _ the 
reader is frequently cautioned to use 
this test experimentally rather than 
as a finished instrument. In fact, a 
chapter on scoring procedures, ap- 
parently written for this book, was 
not included because the author 
does not wish to freeze scoring 
methods and norms at this point, 
since new evidences of the effects of 
cultural factors in the use and inter- 
pretation of the test are constantly 
coming in, particularly from the 
United States. The latter results 
seem to differ markedly from those 
obtained in Europe, where most of 
the early work was done. In spite of 
the cautions constantly presented, the 
impact of the “typical’’ cases and 
the large number of case descriptions 
leaves the impression that the test 
can be used with a reasonable degree 
of assurance as a diagnostic clinical 
instrument. 


The chapters dealing with the 


description of the test convey the 
impression that the designs produced 
by subjects can be readily classified 
into stated categories which lead to 


almost automatic diagnosis. On the 
other hand, the parts of the book 
which concern themselves with the 
use of the test bring the reader back 
to reality with a jolt. Warnings about 
the difficulties in classification ex- 
perienced with live cases, examples 
of borderline and ambiguous designs, 
samples of overlapping diagnostic 
possibilities for similar designs, in- 
stances of subjective interpretation, 
and conflicting reports by different 
workers abound in the many case 
samples presented, and serve to 
disabuse the reader of any hopes for 
a reasonably simple and useful tech- 
nique for personality appraisal, in 
the event that he had entertained 
such hopes. 

In spite of twenty-five years of 
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experimentation and clinical use, 
this test is still in its beginning 
stages. It is now ready for validation. 
There is no doubt about the fact that 
competent clinical psychologists will 
find this a useful projective tech- 
nique—an important addition to 
the clinical battery—if they are not 
already using it. But then, competent 
clinical psychologists can find almost 
any series of samples of human be- 
havior useful for diagnostic purposes, 
particularly one that has had con- 
siderable use, and therefore presents 
some basis for comparison of results. 
Those who insist on “respectable” 
methods of objectification, and on 
standard approaches to reliability 
and validity will, no doubt, be as 
violent in opposition to this test as 
they are toward other projective 
methods. 

The 144 separate plates, beauti- 
fully printed in color, constitute, in 
effect, approximations to norms for 
the Mosaic test, although this is 
disclaimed by the author. They add 
a great deal to the usefulness of the 
volume. 

Morris KRUGMAN 

New York City Board of Education 


BELLAK, LEopo_tp. The Thematic 
Apperception Test and the Chil- 
dren's Apperception Test in clinical 
use. New York: Grune and Strat- 
ton, 1954. Pp. x +282. $6.75. 


More than anything else this book 
gives evidence of the extensivity and 
maturity of Bellak’s clinical experi- 
ence. Both in the expository sections 
of the volume, in which the author 
elucidates his thinking about such 
items as theoretical foundations for 
projective testing and the use of 
the TAT in psychotherapy, as well 
as in the case illustration sections in 
which sample TAT and CAT stories 
are analyzed and interpreted, there 
is a richness in clinical wisdom and 
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an erudition in psychoanalytic per- 
sonality theory. However, there are 
shortcomings of the book as a manual 
for TAT and CAT interpretation, 
unless one wishes to accept Bellak’s 
interpretive procedure ‘“‘lock, stock, 
and barrel.” 

Bellak’s basic premise rests on his 
statement: ‘‘Since I believe that the 
strength of the TAT lies in its ability 
to elicit the content and dynamics of 
interpersonal relationships and the 
psychodynamic patterns, my method 
of interpretation and my scoring 
categories are primarily concerned 
with these dimensions and only to a 
small extent with the formal char- 
acteristics.”” Based on this convic- 
tion, he presents his method of analy- 
sis for the TAT using such factors as 
“main needs of hero,”’ ‘“‘nature of 
anxieties,’ ‘‘main defenses against 
conflicts,””’ and “integration of the 
ego.”” <A different, but somewhat 
similar schema is suggested for the 
CAT. The examiner is then expected 
to develop inferences following this 
type of analysis of thematic stories, 
based on some general principles that 
are proposed. 

There seems to be little doubt that, 
according to present knowledge, the 
evaluation of projective test results 
for an individual will require a clinical 
interpretation of both qualitative 


and quantitative aspects of the test 


protocol plus an integration of such 
derived hypotheses with the life 
history and/or clinical behavior of 
the individual, as a minimum. Never- 
theless, one may question the value 
of Bellak’s schema on several counts, 
not necessarily of equal significance: 
(a) There is no necessary relationship 
between his projective theory and 
his method of analysis. The bridge 
is a tenuous one. (b) No evidence 
is presented concerning the scoring 
reliability for the suggested ‘‘scoring 
categories.” Many of these appar- 
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ently require considerable clinical 
skill, and may, presumably, be evalu- 
ated differently by different exam- 
iners. (c) One may be fascinated by 
the nature of the scoring categories 
but one may also wonder what the 
relationship of each of them is to the 
observed or latent personality char- 
acteristics of the examinee. Bellak 
has frequent recourse to the phrase 
“in my experience,” but this type 
of evidence may not be convincing to 
other clinicians. (d) There are no 
clear bases for judging the “ade- 
quacy”™’ of such factors as “severity 
of superego” and “‘integration of the 
ego.’ In a sense, each examiner is 
forced to define these phrases opera- 
tionally in terms of test protocols and 
hope that they mean the same things 
in TAT material as they do in 
dreams, therapeutic sessions, etc. (e) 
While reference is made to experi- 
mental and clinical studies of TAT 
by other authors, these findings seem 
to have no observable impact on the 
question of the validity of Bellak’s 
system.” 

In all of the above, I am not pre- 
suming that results achieved with 
Bellak’s methodology lack validity. 
| only mean to point out that the 
evidence for any judgment in this 
regard is almost entirely lacking. It 
seems to this reviewer that there 
should be a clear distinction made 
between theoretical constructs, ad- 
mirable as they may be, and the 
validity of evaluative methods for 
describing or predicting specified 
aspects of behavior on the basis of 
thematic protocols, which are de- 
veloped in relation to such constructs. 
It is conceivable that the first pillar 
may be essentially sound and that 
the second, inferred and operational, 
pillar may be of questionable sound- 
ness. 


Max L. Hutt 
University of Michigan 
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COMMENT CONCERNING SAPPENFIELD’'S REVIEW OF 
ARNOLD AND GASSON’S THE HUMAN PERSON 


MAGDA B. ARNOLD 
Loyola University 


In the January number of the Psy- 
chological Bulletin’ Dr. Bert R. Sap- 
penfield wrote a review of The Human 
Person,? edited by Dr. J. A. Gasson 
and myself, in which he says: 


The reader is at once impressed by the fact 
that the authors .. . subordinate psychologi- 
cal principles to the articles of their religious 
faith—that they attempt to remodel psy- 
chology in the image of an ideology (p. 89). 


And he closes: 

The non-Catholic reader .. . is likely to find 
it disappointing since it provides... little 
more than a discussion of psychological prin- 
ciples in terms of their agreement or disa- 
greement with Catholic doctrines (p. 90). 


Lest the reader take these com- 
ments to be a statement of fact, let 
me point out that they reflect a seri- 
ous misunderstanding of our position 
as stated in the very first chapter of 
the book and adhered to throughout. 
In the first chapter we have tried to 
make explicit the various assump- 
tions on which psychological theories 
can be based (e.g., that man is only 
quantitatively different from ani- 
mals, or, that he has some capacities 
the animal does not possess). These 
assumptions either reflect the world 
view of the scientist (hence an un- 
acknowledged philosophy) or they 
may be taken from a_ particular 
philosophical system. Our own as- 
sumptions are presented as an al- 
ternative to the currently popular 
world view (see Chap. 1, pp. 11-16)? 
and are based on Thomistic philoso- 
phy. This, like any other kind of 


1 Psychol. Bull., 1955, 52, 88-90. 

*Macpa B. Arnotp & J. A. Gasson, 
The human person: an approach to an integral 
theory of personality. New York: Ronald Press, 
1954, 


philosophy, is the fruit of reflection, 
and hence has no intrinsic connection 
with Catholic doctrine. Non-Catho- 
lics or even non-Christians may be 
Thomists; one of the best known 
modern Thomists is, for instance, 
Mortimer Adler. In contrast, Catho- 
lic doctrine is based on revelation, is 
a matter of supernatural faith and 
makes no pronouncement on the mat- 
ter or method of empirical or natural 
science. Hence Catholic doctrine 
does not touch the subject matter of 
psychology, namely the psychological 
activities of the human being. 
Obviously, every philosophy can 
be called an ‘‘ideology:”’ the material- 
istic determinism on which Freud's 
system is based as well as the logical 
positivism that is the basis of much 
modern psychological theory® or the 
dialectical materialism underlying 
Russian psychology. In that sense 
Thomistic philosophy is an ideology, 
too, and Dr. Sappenfield is free to 
take it or leave it, as suits his own 
systematic position. But to imply 
that the authors of The Human Per- 
son attempt to “remodel psychology 
in the image of an ideology’’ while 
other psychologists do not, is to take 
unfair advantage of the philosophical 
naiveté of the reader. And to inti- 
mate that our particular systematic 
position cannot lead to valid scientif- 
ic conclusions, and hence is little 
more than Catholic propaganda, be- 
trays an unconscious prejudice that 
is surprising in a psychologist. 
Another misrepresentation of our 
position is apparent in Dr. Sappen- 
field's statement that in our book 
*Cf. M. H. Marx (Ed.), Psychological 


theory: contemporary readings. New York: 
Macmillan, 1951. 
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the ego is conceived to function independently 
of, and to transcend the influence of, motiva- 
tion. Thus, the authors deny determinism 
and subscribe to ‘‘free choice” or “‘free will’’— 
a conception ...that denies the potential 
predictability of human actions (footnote 1, 
p. 89). 


But in the first chapter of our book 
we take great pains to show that self- 
determination is not independent of 
motivation but acts by eliminating 
alternatives and strengthening 
one particular motive. Moreover, 
throughout the book we point out 
that self-determination does not im- 
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ply either uncaused action or un- 
motivated action or unpredictable 
action. On the contrary, self-de- 
termination is the factor that makes 
prediction possible (cf. p. 25, p. 40, 
p. 224).2 Now it is the reviewer's 
right, if not his duty, to take issue 
with the logic of our argument if that 
is at fault; in that case, he has to 
point out the defect in the logic, not 
just deny the conclusion. But to mis- 
take the authors’ position and then 
attribute to them the bad logic con- 
sequent upon such misstatement is 
hardly responsible reviewing. 
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ERRATUM 
In the article entitled ‘Estimating the chance expectancies of 
diadic relationships within a group” by R. Tagiuri, J. S. Bruner, 
and N. Kogan (this JouRNAL, 1955, 52, 122-131) there is an error 


in the second column of page 124. The sentence should read as 
follows: 


“Certain diads can occur in only one way, e.g., S;—+—S;; others can 
come about in two ways, e.g., Si 5S; could also.bbe S;  <5S;.” 


There is also an error in the last sentence of footnote 3 on page 
126. The sentence should read as follows: 


“Thus S;—4t-: S; is not symmetrical, while S,—3 % 5S; is.” 
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